设为首页加入收藏
  • 首页
  • Start up
  • 当前位置:首页 >Start up >【】

    【】

    发布时间:2025-10-11 12:45:04 来源:都市天下脉观察 作者:Start up

    Latest

    AI

    Amazon

    Apps

    Biotech & Health

    Climate

    Cloud Computing

    Commerce

    Crypto

    Enterprise

    EVs

    Fintech

    Fundraising

    Gadgets

    Gaming

    Google

    Government & Policy

    Hardware

    Instagram

    Layoffs

    Media & Entertainment

    Meta

    Microsoft

    Privacy

    Robotics

    Security

    Social

    Space

    Startups

    TikTok

    Transportation

    Venture

    More from TechCrunch

    Staff

    Events

    Startup Battlefield

    StrictlyVC

    Newsletters

    Podcasts

    Videos

    Partner Content

    TechCrunch Brand Studio

    Crunchboard

    Contact Us

    Image Credits:Inception
    AI

    Inception emerges from stealth with a new type of AI model

    Marina Temkin 11:00 AM PST · February 26, 2025

    Inception, a new Palo Alto-based company started by Stanford computer science professor Stefano Ermon, claims to have developed a novel AI model based on “diffusion” technology. Inception calls it a diffusion-based large language model, or a “DLM” for short.

    The generative AI models receiving the most attention now can be broadly divided into two types: large language models (LLMs) and diffusion models. LLMs are used for text generation. Meanwhile, diffusion models, which power AI systems like Midjourney and OpenAI’s Sora, are mainly used to create images, video, and audio. 

    Inception’s model offers the capabilities of traditional LLMs, including code generation and question-answering, but with significantly faster performance and reduced computing costs, according to the company.

    Ermon told TechCrunch that he has been studying how to apply diffusion models to text for a long time in his Stanford lab. His research was based on the idea that traditional LLMs are relatively slow compared to diffusion technology.   

    With LLMs, “you cannot generate the second word until you’ve generated the first one, and you cannot generate the third one until you generate the first two,” Ermon said. 

    Ermon was looking for a way to apply a diffusion approach to text because, unlike with LLMs, which work sequentially, diffusion models start with a rough estimate of data they’re generating (e.g. ,a picture), and then bring the data into focus all at once.

    Ermon hypothesized generating and modifying large blocks of text in parallel was possible with diffusion models. After years of trying, Ermon and a student of his achieved a major breakthrough, which they detailed in a research paper published last year.

    Techcrunch event

    Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

    Netflix, Box, a16z, ElevenLabs, Wayve, Sequoia Capital, Elad Gil — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch, and a chance to learn from the top voices in tech. Grab your ticket before Sept 26 to save up to $668.

    Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

    Netflix, Box, a16z, ElevenLabs, Wayve, Sequoia Capital, Elad Gil — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch, and a chance to learn from the top voices in tech. Grab your ticket before Sept 26 to save up to $668.

    San Francisco | October 27-29, 2025 REGISTER NOW

    Recognizing the advancement’s potential, Ermon founded Inception last summer, tapping two former students, UCLA professor Aditya Grover and Cornell professor Volodymyr Kuleshov, to co-lead the company. 

    While Ermon declined to discuss Inception’s funding, TechCrunch understands that the Mayfield Fund has invested.

    Inception has already secured several customers, including unnamed Fortune 100 companies, by addressing their critical need for reduced AI latency and increased speed, Emron said.

    “What we found is that our models can leverage the GPUs much more efficiently,” Ermon said, referring to the computer chips commonly used to run models in production. “I think this is a big deal. This is going to change the way people build language models.”

    Inception offers an API as well as on-premises and edge device deployment options, support for model fine-tuning, and a suite of out-of-the-box DLMs for various use cases. The company claims its DLMs can run up to 10x faster than traditional LLMs while costing 10x less.

    “Our ‘small’ coding model is as good as [OpenAI’s] GPT-4o mini while more than 10 times as fast,” a company spokesperson told TechCrunch. “Our ‘mini’ model outperforms small open-source models like [Meta’s] Llama 3.1 8B and achieves more than 1,000 tokens per second.”

    “Tokens” is industry parlance for bits of raw data. One thousand tokens per second is an impressive speed indeed, assuming Inception’s claims hold up.

    • 上一篇:Online radicalization fuels wave of left
    • 下一篇:Democrats cry over Kimmel but ignored Biden's COVID censorship push

      相关文章

      • Jimmy Kimmel shares photo with Nixon enemies list alumni, TV legend Norman Lear
      • Toma’s AI voice agents have taken off at car dealerships
      • Meghan Markle has made another angel investment
      • Lightspeed backs Indian home services startup Snabbit as the next big consumer trend
      • Trainer recommends targeting 'non
      • Speedata, a chip startup competing with Nvidia, raises a $44M Series B
      • The Nuclear Company raises $51M to develop massive reactor sites
      • CaaStle board confirms financial distress, furloughing employees
      • Police 'made contact' with Tyler Robinson near rifle 'drop point': sources
      • Meet Ponte Labor, a startup matching Hispanic immigrants to jobs using WhatsApp

        随便看看

      • Apple TV+ postpones 'The Savant' after Charlie Kirk assassination
      • Affiniti's 20
      • Acorns acquires family wealth and digital memory platform EarlyBird
      • Amazon’s Zoox begins robotaxi testing in Los Angeles 
      • Fetterman rules out party switch, vows to maintain independent voice
      • 5 days left to claim your exhibitor table for TC All Stage 
      • 3 Days Left: Claim your spot on the Expo floor at TC All Stage
      • Rising star defense tech startup Mach Industries is raising $100 million, sources say
      • Man charged with terroristic threat against Kirk vigil at Texas university
      • SF mayor Daniel Lurie to tech CEOs: 'How can we get you back?'
      • Copyright © 2025 Powered by 【】,都市天下脉观察   辽ICP备198741324484号sitemap