-
Spain women's star Putellas to join London City Lionesses
-
WNBA suspends Thomas for fist to Clark's throat
-
England showing Premier League edge at World Cup: Eze
-
UK'S King Charles breaks precedent to reveal £30 mn paid in taxes since 2022
-
Nasdaq falls again on mixed day for US stocks, oil prices rise
-
Yoon grabs early Women's PGA Championship lead with Korda in hunt
-
France squad look to do grieving Deschamps proud in final World Cup group game
-
Will Taylor Swift and Travis Kelce wed in New York? Clues abound
-
Mayweather's Athens fight with Zambidis is off: report
-
Lawyer says Vondrousova 'should appeal' against four-year ban
-
Alonso committed to Aston Martin, but keeping options open
-
Hospitals raise alert as heatwave slams Europe
-
Events cancelled, records loom as heatwave reaches Germany
-
'Alligator Alcatraz' detention center shuts in US: official
-
Czech striker Schick ends international career
-
Tennis great Evert says 'relentless' cancer has returned
-
US says wants deal with Iran, but not 'at any price'
-
Colombian president-elect gives armed groups one month to surrender
-
US Supreme Court hands win to Bayer in weedkiller litigation
-
Apple raises prices for MacBooks and iPads, as costs soar over AI
-
Dominant Osaka sails into Bad Homburg semis
-
UK suffers as heat breaks new June record
-
US Supreme Court says asylum seekers can be turned away before border
-
Binance to suspend crypto services in several EU countries
-
Olivia Wilde looks at evolving relationships in 'The Invite'
-
Hamilton reveals neck injury that hampered debut year with Ferrari
-
Rows, drones and 'sorry' Son as South Korea await World Cup fate
-
Noosha Aubel and Dietmar Woidke: How Potsdam Is Letting Down a Young Child with Profound Disabilities
-
Greek families receive keepsakes of Holocaust victims
-
Antonelli welcomes Mercedes upgrade ast Russell says beware Hamilton
-
Easyjet rejects latest takeover bid but leaves door ajar
-
HRW denounces Turkey arrests ahead of NATO summit
-
Macron hosts Meloni for Riviera talks after Trump rift
-
Alonso committed to Aston Martin, but is keeping options open
-
US Supreme Court paves way for mass deportation of Haitians, Syrians
-
Venezuelans trapped alive after twin quakes kill at least 164
-
South Africa vows firm response to anti-migrant violence
-
New Zealand make England toil as Stokes returns for series decider
-
Poland, Ukraine hold key Gdansk conference without Zelensky
-
Americans impacted by climate change demand answers from lawmakers
-
Massive police deployment blocks Kenya protest anniversary
-
Heat-struck Italians cool off in ancient stone 'trulli'
-
Court orders TotalEnergies to account for clients' emissions
-
French teaching unions call strike over 'unacceptable' heat
-
US Fed's preferred inflation gauge hits fresh three-year high
-
Venezuela twin quakes kill at least 164 with many trapped under rubble
-
Dominant Osaka cruises into Bad Homburg semis
-
IOC votes to continue ski mountaineering for 2030 Games
-
New Zealand frustrate England as Stokes returns for series decider
-
Stocks rally on AI optimism after Micron's blowout forecast
Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks
From edge inference to NVIDIA STX, purpose-built KV cache infrastructure for consistent performance at scale.
SUNNYVALE, CA / ACCESS Newswire / April 21, 2026 / Graid Technology, the pioneer in GPU-accelerated NVMe storage, today announced its Agentic AI Storage Portfolio: a purpose-built family of KV cache solutions designed to eliminate the storage bottleneck that stalls "always-on" production AI. The portfolio spans three deployment tiers: KV Cache Server, KV Cache Rack, and KV Cache Platform, all built on SupremeRAID™ technology. KV Cache Platform, the portfolio's highest tier, is purpose-aligned to NVIDIA's STX reference architecture, with native BlueField-4 DPU execution on the roadmap for H2 2026.
As agentic AI moves from experimentation to production, the infrastructure assumptions that underpinned single-shot inference have broken down. Models running continuous multi-step tasks and maintaining context across hours of operation generate KV cache demands that overwhelm GPU HBM. The result: latency spikes up to 18x, GPU utilization as low as 50%, and model-level failures, including hallucinations and reasoning degradation, that are difficult to detect and costly to recover from.
SupremeRAID™ addresses this directly, aggregating up to 32 NVMe drives into a single 280 GB/s virtual pool, bypassing the CPU via GPU Direct Storage, and delivering KV cache reads at 1.3ms- 77x faster than standard NVMe. The three portfolio tiers bring this capability to every deployment scale:
KV Cache Server - single-node NVMe acceleration for individual inference servers and edge AI deployments. Available now.
KV Cache Rack - rack-scale, partner-validated solutions co-engineered with leading server OEM partners for enterprise multi-GPU clusters. Available now.
KV Cache Platform - Purpose-built for NVIDIA's STX reference architecture, with native BlueField-4 DPU execution and rack-scale storage expansion on the roadmap.
"A year ago, at GTC 2025, Jensen Huang predicted that storage would become GPU-accelerated for the first time. This year, NVIDIA turned that concept into an architecture with STX and CMX," said Leander Yu, CEO of Graid Technology. "Our KV Cache Portfolio is built for precisely this moment, delivering the storage performance that agentic AI demands, at storage-tier economics."
For enterprises and infrastructure teams evaluating agentic AI deployments, the full deployment architecture, technical specifications, and NVIDIA STX compatibility details are available in the solution brief: Graid Technology Agentic AI Storage Portfolio: Purpose-built KV Cache Solutions for Inference at Scale
To learn more about Graid Technology's AI offerings, visit graidtech.com/ai
Media Inquiries:
Andrea Eaken, Sr. Director of Marketing, Americas & EMEA
[email protected]
____________________________________
About Graid Technology
Graid Technology is building the storage backbone for the future of AI, enterprise, and high-performance computing. As the creator of SupremeRAID™, the world's first and only GPU-based RAID, and the global steward of Intel® Virtual RAID on CPU (Intel® VROC), Graid Technology delivers flexible RAID solutions that maximize NVMe performance while ensuring resilient, scalable data protection for modern data infrastructure. Headquartered in Silicon Valley with global operations and R&D in Taiwan, Graid Technology is advancing RAID innovation for the next generation of data-intensive workloads. To learn more, visit graidtech.com.

SOURCE: Graid Technology Inc.
View the original press release on ACCESS Newswire
T.Vitorino--PC