Yannic Kilcher

Scalable MatMul-free Language Modeling (Paper Explained) Yannic Kilcher 23,937 7 дней назад
xLSTM: Extended Long Short-Term Memory Yannic Kilcher 33,130 1 месяц назад
[ML News] Chips, Robots, and Models Yannic Kilcher 27,964 2 месяца назад
TransformerFAM: Feedback attention is working memory Yannic Kilcher 36,011 2 месяца назад
[ML News] Devin exposed | NeurIPS track for high school students Yannic Kilcher 39,932 2 месяца назад
[ML News] Llama 3 changes the game Yannic Kilcher 46,513 2 месяца назад
xLSTM: Extended Long Short-Term Memory Yannic Kilcher 33,130 1 месяц назад
Hugging Face got hacked Yannic Kilcher 30,760 2 месяца назад
[ML News] Llama 3 changes the game Yannic Kilcher 46,513 2 месяца назад
Flow Matching for Generative Modeling (Paper Explained) Yannic Kilcher 41,283 3 месяца назад
No, Anthropic's Claude 3 is NOT sentient Yannic Kilcher 43,254 4 месяца назад
What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news) Yannic Kilcher 32,454 4 месяца назад
Gemini has a Diversity Problem Yannic Kilcher 53,158 4 месяца назад
Let's build the GPT Tokenizer Andrej Karpathy 548,515 4 месяца назад
[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama Yannic Kilcher 31,995 4 месяца назад
[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles Yannic Kilcher 40,394 4 месяца назад