AI inference

21 articles
BenzingaBenzinga··Erica Kollmann

Chinese Tech Giants Pay $1M Per Nvidia B300 as Export Controls Fuel Gray Market

Chinese firms pay ~$1M per Nvidia B300 server—double U.S. prices—amid export restrictions, while China's AI token usage surges to 32% globally.
NVDAAMDBABASMCIAI chipsChina market
The Motley FoolThe Motley Fool··Harsh Chauhan

Nvidia Reclaims $5T Milestone as Path to $10T Valuation Comes Into View

$NVDA regains $5 trillion market cap with potential to become world's first $10 trillion company within three years, driven by AI inference dominance.
NVDAMETAdata centerearnings growth
BenzingaBenzinga··Akanksha Bakshi

Nebius Bets $643M on Eigen to Dominate AI Inference Market

Nebius acquires Eigen AI for $643M to strengthen AI inference capabilities. Stock rises 2.97% in premarket trading despite elevated 20.42% short interest.
NBISacquisitionearnings
BenzingaBenzinga··Erica Kollmann

Rambus Stock Tumbles 10% on EPS Miss and Weak Q2 Guidance Despite Revenue Beat

Rambus stock plummets 10.48% after Q1 EPS miss and weak Q2 guidance, despite beating revenue expectations.
NVDAMSFTAAPLRMBSdata centerguidance
The Motley FoolThe Motley Fool··Rich Smith

Micron Surges on Intel's AI Pivot: Why Memory Demand Could Be Next

$MU gains 4.5% as Intel's earnings beat signals strong AI inference demand, bolstering prospects for high-bandwidth memory chips.
NVDAMUINTCvaluationguidance
GlobeNewswire Inc.GlobeNewswire Inc.··Na

Digital Realty to Invest Nearly S$7B in Singapore, Positioning City as Asia Pacific AI Hub

Digital Realty commits S$7 billion to Singapore data center expansion and AI innovation, positioning the city as Asia Pacific's infrastructure hub.
DLRDLRpJDLRpKDLRpLAI infrastructurecloud infrastructure
The Motley FoolThe Motley Fool··Justin Pope

Nvidia vs. Broadcom: AI Chip Giants Diverge as Inference Era Begins

Nvidia and Broadcom lead AI chip market as focus shifts from training to inference. Nvidia targets $1T sales by 2027; Broadcom forecasts $100B+ AI revenue. Similar valuations, different growth profiles.
NVDAMETAGOOGGOOGLAVGOagentic AIGPU chips
The Motley FoolThe Motley Fool··Billy Duberstein

Google's TurboQuant Sparks Memory Stock Selloff, But Bullish Case Remains Strong

Google's TurboQuant compression tech cuts DRAM needs 6x and boosts speeds 8x, rattling memory stocks like $MU despite likely long-term demand benefits.
MUGOOGGOOGLsemiconductor demandAI inference
The Motley FoolThe Motley Fool··Manali Pradhan, Cfa

Micron Stock Plunges 18% Despite Stellar Earnings: A Buying Opportunity?

Micron stock fell 18% post-earnings despite 196% revenue and 682% EPS growth, driven by pricing and AI compression concerns. Supply tightness expected through 2026.
MUGOOGGOOGLdata centercapital expenditure
The Motley FoolThe Motley Fool··Timothy Green

Intel's $949 Arc Pro B70 GPU Challenges Nvidia's Dominance in Local AI Computing

Intel launches Arc Pro B70 GPU at $949, undercutting Nvidia and AMD alternatives by hundreds of dollars to capture growing local AI workload market.
NVDAAMDINTCGPUAI inference
The Motley FoolThe Motley Fool··Adam Spatacco

Nvidia's $1T Order Pipeline Signals AI Inference Boom, But Market Stays Unmoved

Nvidia's $1T chip order pipeline through 2027 hints at massive AI inference opportunity, but muted stock reaction reflects already-high expectations and valuation pressures.
NVDAAMDMSFTAMZNGOOG+2hyperscalersvaluation
The Motley FoolThe Motley Fool··Keithen Drury

Broadcom's AI Chips Surge 140% as Custom Accelerators Drive $100B Revenue Target

Broadcom's custom AI accelerators grow 140% annually, targeting $100B+ revenue by 2027. Trading at 30x forward earnings, the premium valuation reflects faster growth than Nvidia in the expanding AI chip market.
NVDAAVGOAI chipshyperscalers
The Motley FoolThe Motley Fool··Danny Vena, Cpa

Nvidia Reignites China AI Chip Push With H200 Production Restart

Nvidia restarts H200 chip production for China following regulatory approval, potentially unlocking a $50 billion market and boosting growth.
NVDAAI chipssemiconductor
Investing.comInvesting.com··Ali Merchant

Nvidia's $1T Revenue Bet: Can AI Inference Live Up to the Hype?

Nvidia forecasts $1 trillion revenue by 2027 via AI inference, but investor skepticism persists amid intensifying competition and custom chip threats.
NVDAAMDIBMINTCcompetitionAI inference
BenzingaBenzinga··Business Wire

Penguin Solutions Launches OriginAI Factory to Tackle GPU Memory Bottleneck in Enterprise AI

Penguin Solutions launches OriginAI Factory Platform integrating MemoryAI and ICE ClusterWare with NVIDIA GPUs to optimize enterprise AI inference, targeting financial services, healthcare, and retail sectors.
NVDAPENGenterprise AIAI inference
BenzingaBenzinga··Business Wire

Penguin Solutions Launches First Production CXL Memory Server to Solve AI Inference Bottleneck

Penguin Solutions debuts MemoryAI, an 11TB CXL-based KV cache server offering 10x faster AI inference speeds than NVMe, compatible with NVIDIA's architecture.
NVDAPENGagentic AIenterprise AI
GlobeNewswire Inc.GlobeNewswire Inc.··Vci Global Limited

VCI Global Launches AI Compute Treasury Strategy Built on NVIDIA GPU Infrastructure

$VCIG launches AI Compute Treasury strategy to accumulate NVIDIA GPU infrastructure, targeting growing AI inference demand in market projected to reach $394.5B by 2030.
NVDAVCIGAI infrastructuregenerative AI
GlobeNewswire Inc.GlobeNewswire Inc.··Vci Global Limited

VCIG Unveils AI Compute Treasury Strategy Backed by NVIDIA Blackwell GPUs

VCIG Global launches AI Compute Treasury strategy to accumulate NVIDIA GPU infrastructure, supporting enterprise AI inference workloads through scalable revenue reinvestment model.
NVDAVCIGAI infrastructuregenerative AI
Investing.comInvesting.com··Jeffrey Neal Johnson

CoreWeave's Perplexity Deal Signals Seismic Shift From AI Training to Inference

CoreWeave's Perplexity deal validates shift from AI training to inference workloads. $66.8B backlog and $30-35B 2026 capex support analyst targets of $124.34.
NVDACRWVcloud infrastructureGPU computing
The Motley FoolThe Motley Fool··Geoffrey Seiler

Inference Market Set to Reshape AI Chip Hierarchy as Spending Reaches $700B

AI inference spending surges toward $700B by 2026, challenging Nvidia's dominance as cloud providers develop custom chips and competitors like Broadcom and AMD gain ground.
NVDAAMDGOOGGOOGLTSM+1hyperscalersTensor Processing Units