AI Video Startup Decart Achieves 4x Faster Real-Time Video Generation at Half GPU Cost Using AWS Trainium3, Challenging NVIDIA's Inference Dominance

Amazon Web Services has scored a major win for its custom AWS Trainium accelerators after striking a deal with AI video startup Decart, with the partnership seeing Decart optimize its flagship Lucy model on AWS Trainium3 to support real-time video generation CNBC

December 15, 2025
|

Amazon Web Services has scored a major win for its custom AWS Trainium accelerators after striking a deal with AI video startup Decart, with the partnership seeing Decart optimize its flagship Lucy model on AWS Trainium3 to support real-time video generation CNBC. Decart is achieving 4x faster inference for real-time generative video at half the cost of GPUs OpenAI, demonstrating that custom AI accelerators can challenge NVIDIA's dominance in computationally intensive generative AI applications.

Decart is essentially going all-in on AWS, making its models available through the Amazon Bedrock platform, allowing developers to integrate real-time video generation capabilities into almost any cloud application without worrying about underlying infrastructure CNBC. The company has obtained early access to the newly announced Trainium3 processor, capable of outputs of up to 100 fps and lower latency CNBC.

Lucy has a time-to-first-frame of 40ms, meaning it begins generating video almost instantly after prompt, and by streamlining video processing on Trainium, can match the quality of much slower, more established video models like OpenAI's Sora 2 and Google's Veo-3, generating output at up to 30 fps CNBC. By running Lucy on Trainium3, Decart hopes to improve current 30 fps outputs and generate live video at up to 100 FPS while reducing time-to-first frame to less than 40 milliseconds Thriveholdings.

Trainium3 UltraServers deliver up to 4.4x more compute performance, 4x greater energy efficiency, and almost 4x more memory bandwidth than Trainium2 UltraServers, with systems scaling up to 144 Trainium3 chips delivering up to 362 FP8 PFLOPs OpenAI. Built on 3-nanometer technology, each UltraServer delivers 362 FP8 PFLOPs with up to 20.7 TB of HBM3e memory, enabling massive models to train in weeks instead of months Yahoo Finance.

The partnership reflects broader industry movement toward custom AI accelerators as alternatives to NVIDIA GPUs. AI coding startup Poolside is using AWS Trainium2 to train its models with plans to use its infrastructure for inference as well, while Anthropic is hedging its bets by training future Claude models on a cluster of up to one million Google TPUs, and Meta Platforms is reportedly collaborating with Broadcom to develop custom AI processors CNBC. AWS claims Trainium and Google's TPUs offer 50-70% lower cost-per-billion-tokens compared to high-end NVIDIA H100 clusters Yahoo Finance.

Dean Leitersdorf, Decart co-founder and CEO, stated that Trainium3's next-generation architecture delivers higher throughput, lower latency, and greater memory efficiency, allowing the company to achieve up to 4x faster frame generation at half the cost of GPUs CNBC.

Leitersdorf emphasized that generative video is one of the most compute-intensive challenges in AI, and by combining Decart's real-time video models with AWS Trainium3, the partnership is making real-time video generation practical and cost-effective at scale Thriveholdings.

Anthropic's early adoption carries symbolic weight as Amazon holds an $8 billion stake in OpenAI's rival, yet chose Trainium for production workloads, with that endorsement signaling Trainium3 isn't experimental but production-ready and competitive with NVIDIA's flagship offerings Yahoo Finance. Yet NVIDIA's moat remains formidable, with CUDA becoming the industry standard for AI development, and switching to Trainium requiring rewriting code and retraining teams Yahoo Finance.

By generating high-fidelity AI video in real time, Decart says it can power use cases that simply weren't possible before, including live gaming where video clips can be incorporated into open-ended video games to generate environments based on player interactions, and social media applications where influencers can integrate AI video into live streams Thriveholdings.

For organizations spending millions monthly on AI infrastructure, Trainium3's economics are transformational, with the chip delivering over 5x more output tokens per megawatt than previous generations, directly slashing data-center power bills Yahoo Finance. Enterprises evaluating AI infrastructure strategies now face credible alternatives to NVIDIA-exclusive architectures, potentially reducing vendor lock-in risks. Amazon acknowledges reality by announcing Trainium4 will support NVIDIA's NVLink Fusion interconnect technology, enabling mixed deployments within the same racks Yahoo Finance.

The real question isn't whether Amazon can match NVIDIA's raw performance as Trainium3 already does, but whether cost and energy efficiency alone reshape a $50 billion+ AI chip market, or whether ecosystem lock-in and customer inertia keep NVIDIA entrenched Yahoo Finance. Decision-makers should monitor whether real-time video generation adoption validates custom accelerator economics across other computationally intensive AI applications. While ASICs aren't going to replace GPUs completely as flexibility of GPUs means they remain the only real option for general-purpose models, specialized workload optimization may fragment AI infrastructure markets CNBC.

Source & Date

Source: Artificial Intelligence News, AWS, Tech Startups, HPCwire, TechCrunch, Invezz
Date: December 3, 2025 (AWS re:Invent 2025

  • Featured tools
Hostinger Website Builder
Paid

Hostinger Website Builder is a drag-and-drop website creator bundled with hosting and AI-powered tools, designed for businesses, blogs and small shops with minimal technical effort.It makes launching a site fast and affordable, with templates, responsive design and built-in hosting all in one.

#
Productivity
#
Startup Tools
#
Ecommerce
Learn more
Wonder AI
Free

Wonder AI is a versatile AI-powered creative platform that generates text, images, and audio with minimal input, designed for fast storytelling, visual creation, and audio content generation

#
Art Generator
Learn more

Learn more about future of AI

Join 80,000+ Ai enthusiast getting weekly updates on exciting AI tools.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

AI Video Startup Decart Achieves 4x Faster Real-Time Video Generation at Half GPU Cost Using AWS Trainium3, Challenging NVIDIA's Inference Dominance

December 15, 2025

Amazon Web Services has scored a major win for its custom AWS Trainium accelerators after striking a deal with AI video startup Decart, with the partnership seeing Decart optimize its flagship Lucy model on AWS Trainium3 to support real-time video generation CNBC

Amazon Web Services has scored a major win for its custom AWS Trainium accelerators after striking a deal with AI video startup Decart, with the partnership seeing Decart optimize its flagship Lucy model on AWS Trainium3 to support real-time video generation CNBC. Decart is achieving 4x faster inference for real-time generative video at half the cost of GPUs OpenAI, demonstrating that custom AI accelerators can challenge NVIDIA's dominance in computationally intensive generative AI applications.

Decart is essentially going all-in on AWS, making its models available through the Amazon Bedrock platform, allowing developers to integrate real-time video generation capabilities into almost any cloud application without worrying about underlying infrastructure CNBC. The company has obtained early access to the newly announced Trainium3 processor, capable of outputs of up to 100 fps and lower latency CNBC.

Lucy has a time-to-first-frame of 40ms, meaning it begins generating video almost instantly after prompt, and by streamlining video processing on Trainium, can match the quality of much slower, more established video models like OpenAI's Sora 2 and Google's Veo-3, generating output at up to 30 fps CNBC. By running Lucy on Trainium3, Decart hopes to improve current 30 fps outputs and generate live video at up to 100 FPS while reducing time-to-first frame to less than 40 milliseconds Thriveholdings.

Trainium3 UltraServers deliver up to 4.4x more compute performance, 4x greater energy efficiency, and almost 4x more memory bandwidth than Trainium2 UltraServers, with systems scaling up to 144 Trainium3 chips delivering up to 362 FP8 PFLOPs OpenAI. Built on 3-nanometer technology, each UltraServer delivers 362 FP8 PFLOPs with up to 20.7 TB of HBM3e memory, enabling massive models to train in weeks instead of months Yahoo Finance.

The partnership reflects broader industry movement toward custom AI accelerators as alternatives to NVIDIA GPUs. AI coding startup Poolside is using AWS Trainium2 to train its models with plans to use its infrastructure for inference as well, while Anthropic is hedging its bets by training future Claude models on a cluster of up to one million Google TPUs, and Meta Platforms is reportedly collaborating with Broadcom to develop custom AI processors CNBC. AWS claims Trainium and Google's TPUs offer 50-70% lower cost-per-billion-tokens compared to high-end NVIDIA H100 clusters Yahoo Finance.

Dean Leitersdorf, Decart co-founder and CEO, stated that Trainium3's next-generation architecture delivers higher throughput, lower latency, and greater memory efficiency, allowing the company to achieve up to 4x faster frame generation at half the cost of GPUs CNBC.

Leitersdorf emphasized that generative video is one of the most compute-intensive challenges in AI, and by combining Decart's real-time video models with AWS Trainium3, the partnership is making real-time video generation practical and cost-effective at scale Thriveholdings.

Anthropic's early adoption carries symbolic weight as Amazon holds an $8 billion stake in OpenAI's rival, yet chose Trainium for production workloads, with that endorsement signaling Trainium3 isn't experimental but production-ready and competitive with NVIDIA's flagship offerings Yahoo Finance. Yet NVIDIA's moat remains formidable, with CUDA becoming the industry standard for AI development, and switching to Trainium requiring rewriting code and retraining teams Yahoo Finance.

By generating high-fidelity AI video in real time, Decart says it can power use cases that simply weren't possible before, including live gaming where video clips can be incorporated into open-ended video games to generate environments based on player interactions, and social media applications where influencers can integrate AI video into live streams Thriveholdings.

For organizations spending millions monthly on AI infrastructure, Trainium3's economics are transformational, with the chip delivering over 5x more output tokens per megawatt than previous generations, directly slashing data-center power bills Yahoo Finance. Enterprises evaluating AI infrastructure strategies now face credible alternatives to NVIDIA-exclusive architectures, potentially reducing vendor lock-in risks. Amazon acknowledges reality by announcing Trainium4 will support NVIDIA's NVLink Fusion interconnect technology, enabling mixed deployments within the same racks Yahoo Finance.

The real question isn't whether Amazon can match NVIDIA's raw performance as Trainium3 already does, but whether cost and energy efficiency alone reshape a $50 billion+ AI chip market, or whether ecosystem lock-in and customer inertia keep NVIDIA entrenched Yahoo Finance. Decision-makers should monitor whether real-time video generation adoption validates custom accelerator economics across other computationally intensive AI applications. While ASICs aren't going to replace GPUs completely as flexibility of GPUs means they remain the only real option for general-purpose models, specialized workload optimization may fragment AI infrastructure markets CNBC.

Source & Date

Source: Artificial Intelligence News, AWS, Tech Startups, HPCwire, TechCrunch, Invezz
Date: December 3, 2025 (AWS re:Invent 2025

Promote Your Tool

Copy Embed Code

Similar Blogs

January 2, 2026
|

Top 10 AI‑Driven Marketing Tools Transforming Growth in 2026

Marketing has entered a new era powered by artificial intelligence. From automating repetitive tasks and personalizing campaigns to generating content and uncovering deep audience insights.
Read more
January 2, 2026
|

Top 10 AI Manufacturing Platforms in 2026

Artificial intelligence is transforming manufacturing by enabling predictive maintenance, quality control, supply chain optimization, and autonomous operations. AI platforms help manufacturers boost productivity.
Read more
January 2, 2026
|

Top 10 AI Events You Should Know in 2026

Artificial intelligence is evolving rapidly, and staying ahead requires connecting with experts, exploring new research, and sharing ideas with peers. AI conferences and events are where breakthroughs are announced.
Read more
January 2, 2026
|

Top 10 AI Influencers Shaping the Future of Artificial Intelligence in 2026

Artificial intelligence isn’t just about algorithms and data it’s about the people who drive research, influence policy, and shape how AI is adopted across industries and society.
Read more
January 2, 2026
|

Top 10 Global AI Consulting Firms Leading Digital Transformation in 2026

Artificial intelligence is reshaping industries, from healthcare and finance to manufacturing and retail. Successful AI adoption requires strategic vision, deep technical expertise, and effective change management.
Read more
January 2, 2026
|

Top 10 Customer Service AI Platforms in 2026

Customer expectations are higher than ever. They demand fast, accurate support 24/7 across multiple channels. To meet these demands efficiently, businesses are increasingly relying on AI-powered.
Read more