World Model on Million-Length Video And Language With RingAttention
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Fractal Patterns May Unravel the Intelligence in Next-Token Prediction
Precise Zero-Shot Dense Retrieval without Relevance Labels
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Relevance-guided Supervision for OpenQA with ColBERT
PLAID: An Efficient Engine for Late Interaction Retrieval
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Corrective Retrieval Augmented Generation
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
A Comprehensive Survey on 3D Content Generation
OLMo: Accelerating the Science of Language Models
Who’s Harry Potter? Approximate Unlearning in LLMs
Parameter-Efficient Transfer Learning for NLP
A Survey on Transformers in Reinforcement Learning
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Matryoshka Representation Learning
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Babbage from The Economist
Cyber Security Headlines
Techmeme Ride Home
Cybersecurity Today
Software Engineering Daily