- 211.@GoodFaithOnly @binarybits When I say AGI I mean a "feature complete" build of a remote worker with a lot of 9s (think: a bit like current Autopilot FSD, maybe plus a few more iterations).›Andrej Karpathy (AI researcher) · LLM score 70 · 3 months ago
- 212.Second @simonbatzner.›Xiang Fu (Researcher at Periodic Labs) · LLM score 80 · 3 months ago
- 213.@lineardiff @ericzelikman I don’t view that as incompatible.›Noam Brown (OpenAI Research Scientist) · LLM score 20 · 3 months ago
- 214.new paper "ATLAS" – for details: https://t.co/MJO9mvlS90; it also covers a new scaling law, pretrain vs finetune for multilingual & more :)›Niklas Muennighoff (AI Researcher at Stanford) · LLM score 70 · 3 months ago
- 215.To scale data-constrained LLMs, repeating & denoising objectives can help.›Niklas Muennighoff (AI Researcher at Stanford) · LLM score 85 · 3 months ago
- 216.State space architecture for state of the art voice model! https://t.co/QjdOdrIFSMTri Dao (Chief Scientist at Together) · LLM score 80 · 3 months ago
- 217.To adapt to today's world, crank down discount factor and crank up learning rateXiang Fu (Researcher at Periodic Labs) · LLM score 75 · 3 months ago
- 218.Happy to share a new paper! Designing model behavior is hard -- desirable values often pull in opposite directions.›John Schulman · LLM score 80 · 3 months ago
- 219.Every AI researcher should read two essays.›Xiang Fu (Researcher at Periodic Labs) · LLM score 80 · 3 months ago
- 220.Below is a deep dive into why self play works for two-player zero-sum (2p0s) games like Go/Poker/Starcraft but is so much harder to use in "real world" domains.›Noam Brown (OpenAI Research Scientist) · LLM score 80 · 3 months ago
- 221.@pli_cachete @karpathy Because it means self play converges to a minimax equilibrium which is essentially an unbeatable strategy.›Noam Brown (OpenAI Research Scientist) · LLM score 80 · 3 months ago
- 222.Self play works so well in chess, go, and poker because those games are two-player zero-sum.›Noam Brown (OpenAI Research Scientist) · LLM score 75 · 3 months ago
- 223..@Stanford courses are high-quality but the policies are definitely outdated.›Noam Brown (OpenAI Research Scientist) · LLM score 20 · 3 months ago
- 224.Awesome to see Veo 3.1 top the LMArena video leaderboards by a large distance with big improvements over Veo 3.0 for text-to-video (+30) and image-to-video (+70)! 🔥Huge congrats to the team! Try it for yourself in https://t.co/QgTpxTL8DQ and the @GeminiApp https://t.co/nQ14ZnkKnrDemis Hassabis (CEO of DeepMind) · LLM score 20 · 3 months ago
- 225.@fchollet Didn't you say just two months ago that you think AGI is about 5 years away?›Noam Brown (OpenAI Research Scientist) · LLM score 20 · 3 months ago
- 226.Super excited to be collaborating with Commonwealth Fusion Systems @CFS_energy to use AI to accelerate fusion development - and move closer to a sustainable future with limitless clean energy https://t.co/kCS1UKEtjPDemis Hassabis (CEO of DeepMind) · LLM score 70 · 4 months ago
- 227.Veo 3 is the state-of-the-art in video models.›Demis Hassabis (CEO of DeepMind) · LLM score 30 · 4 months ago
- 228.We processed over 1.3 Quadrillion tokens last month - that's 1,300,000,000,000,000 tokens! or to put it another way that's 500M tokens a second or 1.8 Trillion tokens an hour...›Demis Hassabis (CEO of DeepMind) · LLM score 75 · 4 months ago
- 229.Some generous companies in Toronto are funding three lectures on AI safety by Owain Evans on Nov 10, 11, 12.›Geoffrey Hinton · LLM score 60 · 4 months ago
- 230.This work, led by @_junxiong_wang and @ben_athi, is a first step towards building AI systems that evolve and get better as you use them.›Tri Dao (Chief Scientist at Together) · LLM score 20 · 4 months ago
- 231.@itsandrewgao Worst-of-N is a good solution to this.›Noam Brown (OpenAI Research Scientist) · LLM score 85 · 4 months ago
- 232.More info on some the cool capabilities of Genie 3 here: https://t.co/wcKztrUSu6Demis Hassabis (CEO of DeepMind) · LLM score 40 · 4 months ago
- 233.Fantastic to see Genie 3, our state-of-the-art world model, featured in @TIME's 2025 Best Inventions.›Demis Hassabis (CEO of DeepMind) · LLM score 40 · 4 months ago
- 234.@littmath I decided to check with a doctor friend and they told me “NSAIDs have a tendency to increase your risk of clotting (despite the fact that they affect the functionality of your platelets resulting in a tendency to bleed)”Noam Brown (OpenAI Research Scientist) · LLM score 40 · 4 months ago
- 235.@littmath Can you point out what specifically in the GPT-5 Thinking response you find problematic? Because so far you’ve not provided any evidence that its critique of “prevents blood clots” is unjustifiedNoam Brown (OpenAI Research Scientist) · LLM score 20 · 4 months ago
- 236.@littmath I'm confused why you're so confidently taking a strong position on this despite admitting that you're not a doctor.›Noam Brown (OpenAI Research Scientist) · LLM score 20 · 4 months ago
- 237.Are these just hallucinations by GPT-5 Thinking? Nope.›Noam Brown (OpenAI Research Scientist) · LLM score 40 · 4 months ago
- 238."One of our goals is to discover superconductors that work at higher temperatures than today's materials"›Noam Brown (OpenAI Research Scientist) · LLM score 70 · 4 months ago
- 239.Bullish, in the coming decades majority of compute will be spent on ai for science https://t.co/8aLDhazs6iJason Wei (AI Researcher at Meta) · LLM score 70 · 4 months ago
- 240.@j2bryson I know some AI startups that I think are way overpriced.›Noam Brown (OpenAI Research Scientist) · LLM score 30 · 4 months ago