设为首页加入收藏
  • 首页
  • Start up
  • 当前位置:首页 >Start up >【】

    【】

    发布时间:2025-09-15 09:30:40 来源:都市天下脉观察 作者:Start up

    Latest

    AI

    Amazon

    Apps

    Biotech & Health

    Climate

    Cloud Computing

    Commerce

    Crypto

    Enterprise

    EVs

    Fintech

    Fundraising

    Gadgets

    Gaming

    Google

    Government & Policy

    Hardware

    Instagram

    Layoffs

    Media & Entertainment

    Meta

    Microsoft

    Privacy

    Robotics

    Security

    Social

    Space

    Startups

    TikTok

    Transportation

    Venture

    More from TechCrunch

    Staff

    Events

    Startup Battlefield

    StrictlyVC

    Newsletters

    Podcasts

    Videos

    Partner Content

    TechCrunch Brand Studio

    Crunchboard

    Contact Us

    Image Credits:Inception
    AI

    Inception emerges from stealth with a new type of AI model

    Marina Temkin 11:00 AM PST · February 26, 2025

    Inception, a new Palo Alto-based company started by Stanford computer science professor Stefano Ermon, claims to have developed a novel AI model based on “diffusion” technology. Inception calls it a diffusion-based large language model, or a “DLM” for short.

    The generative AI models receiving the most attention now can be broadly divided into two types: large language models (LLMs) and diffusion models. LLMs are used for text generation. Meanwhile, diffusion models, which power AI systems like Midjourney and OpenAI’s Sora, are mainly used to create images, video, and audio. 

    Inception’s model offers the capabilities of traditional LLMs, including code generation and question-answering, but with significantly faster performance and reduced computing costs, according to the company.

    Ermon told TechCrunch that he has been studying how to apply diffusion models to text for a long time in his Stanford lab. His research was based on the idea that traditional LLMs are relatively slow compared to diffusion technology.   

    With LLMs, “you cannot generate the second word until you’ve generated the first one, and you cannot generate the third one until you generate the first two,” Ermon said. 

    Ermon was looking for a way to apply a diffusion approach to text because, unlike with LLMs, which work sequentially, diffusion models start with a rough estimate of data they’re generating (e.g. ,a picture), and then bring the data into focus all at once.

    Ermon hypothesized generating and modifying large blocks of text in parallel was possible with diffusion models. After years of trying, Ermon and a student of his achieved a major breakthrough, which they detailed in a research paper published last year.

    Techcrunch event

    Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

    Netflix, Box, a16z, ElevenLabs, Wayve, Sequoia Capital, Elad Gil — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch, and a chance to learn from the top voices in tech. Grab your ticket before Sept 26 to save up to $668.

    Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

    Netflix, Box, a16z, ElevenLabs, Wayve, Sequoia Capital, Elad Gil — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch, and a chance to learn from the top voices in tech. Grab your ticket before Sept 26 to save up to $668.

    San Francisco | October 27-29, 2025 REGISTER NOW

    Recognizing the advancement’s potential, Ermon founded Inception last summer, tapping two former students, UCLA professor Aditya Grover and Cornell professor Volodymyr Kuleshov, to co-lead the company. 

    While Ermon declined to discuss Inception’s funding, TechCrunch understands that the Mayfield Fund has invested.

    Inception has already secured several customers, including unnamed Fortune 100 companies, by addressing their critical need for reduced AI latency and increased speed, Emron said.

    “What we found is that our models can leverage the GPUs much more efficiently,” Ermon said, referring to the computer chips commonly used to run models in production. “I think this is a big deal. This is going to change the way people build language models.”

    Inception offers an API as well as on-premises and edge device deployment options, support for model fine-tuning, and a suite of out-of-the-box DLMs for various use cases. The company claims its DLMs can run up to 10x faster than traditional LLMs while costing 10x less.

    “Our ‘small’ coding model is as good as [OpenAI’s] GPT-4o mini while more than 10 times as fast,” a company spokesperson told TechCrunch. “Our ‘mini’ model outperforms small open-source models like [Meta’s] Llama 3.1 8B and achieves more than 1,000 tokens per second.”

    “Tokens” is industry parlance for bits of raw data. One thousand tokens per second is an impressive speed indeed, assuming Inception’s claims hold up.

    • 上一篇:Getaround braves chilly public markets with SPAC combination
    • 下一篇:TechCrunch+ roundup: Dot

      相关文章

      • Surfe brings your CRM data to LinkedIn — and vice versa
      • BotBuilt wants to lower the cost of homebuilding with robots
      • FlutterFlow attracts cash for its low
      • Now with $5M, Leal invests in AI
      • Why the SPAC route makes sense for Getaround
      • This app lets restaurants and coffee shops charge to use the bathroom
      • Jua raises $16M to build a foundational AI model for the natural world, starting with the weather
      • Building AI guardrails should be part of the process
      • Metalenz ships millions of its tiny cameras and powers up with $30M B round
      • India's Swiggy to cut another 400 jobs amid IPO push

        随便看看

      • Silkhaus gets $7.75M to digitize short
      • PadSplit, a marketplace for affordable shared housing, surpasses 10K+ units
      • Makera is releasing a baby sibling of its Carvera desktop 4
      • UK AI startup Greyparrot bags strategic tie
      • To win over investors, use growth as your differentiator
      • Sony's new 'digital birth certificate' for imaging tech smells a lot like NFTs
      • One week left to apply to speak at TechCrunch Early Stage 2024
      • SentinelOne acquires Peak XV
      • 72 hours left to save on TechCrunch Disrupt passes
      • Beat the clock to save $300 on passes to TechCrunch Early Stage 2024
      • Copyright © 2025 Powered by 【】,都市天下脉观察   辽ICP备198741324484号sitemap