1. 1.
    Trinity large is very sparse (400B-A13B, 256 experts w/ 4 active per token).
    Cameron Wolfe (Researcher at Netflix) · LLM score 85 · about 7 hours ago
  2. 2.
    We got Claude to teach open models how to write CUDA kernels.
    Ben Burtenshaw (Hugging Face Researcher) · LLM score 85 · about 24 hours ago
  3. 3.
    BIG new idea in interpretability called Patterning
    alphaXiv · LLM score 75 · 1 day ago
  4. 5.
  5. 7.
    his work was mostly the genius of Ethan Shen.
    Tim Dettmers (Research Scientist at Ai2) · LLM score 70 · 2 days ago
  6. 11.
  7. 12.
    Finally, we conduct an analysis of variance across SWE-Bench runs.
    Ethan Shen (Ai2 Researcher) · LLM score 95 · 2 days ago
  8. 15.
    Community-built open benchmarks work really well, e.g., Terminal-Bench, HLE, MMTEB.
    Niklas Muennighoff (AI Researcher at Stanford) · LLM score 80 · 2 days ago
  9. 16.
    Had to cut this one for space: 2019: AI can't create art—creativity is uniquely human
    Noam Brown (OpenAI Research Scientist) · LLM score 20 · 3 days ago
  10. 17.
    @0xabi96 It feels like I’m cheating.
    Andrej Karpathy (AI researcher) · LLM score 70 · 3 days ago
  11. 18.
    @ChiragLathiya The nearest neighbor really is some kind of a junior engineer.
    Andrej Karpathy (AI researcher) · LLM score 80 · 3 days ago
  12. 21.
    A few random notes from claude coding quite a bit last few weeks.
    Andrej Karpathy (AI researcher) · LLM score 20 · 3 days ago
  13. 22.
    1987: AI can't win at chess—planning is uniquely human
    Noam Brown (OpenAI Research Scientist) · LLM score 70 · 3 days ago
  14. 24.
    Success in the Information Age was about being able to answer questions.
    Jonathan Ross (TPU Creator) · LLM score 20 · 3 days ago
  15. 25.
    this is a blog post on claude + llama.cpp https://t.co/yej6WsNnQA
    Ben Burtenshaw (Hugging Face Researcher) · LLM score 20 · 3 days ago
  16. 30.