MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
GeoCalib: Learning Single-image Calibration with Geometric Optimization
Artificial Immune System of Secure Face Recognition Against Adversarial Attacks
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
rerankers: A Lightweight Python Library to Unify Ranking Methods
Automated Design of Agentic Systems
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Sapiens: Foundation for Human Vision Models
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
Writing in the Margins: Better Inference Pattern for Long Context Retrieval
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Babbage from The Economist
Cyber Security Headlines
The WAN Show
Software Engineering Daily
Risky Business