On the limits of agency in agent-based models
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
GeoCalib: Learning Single-image Calibration with Geometric Optimization
Artificial Immune System of Secure Face Recognition Against Adversarial Attacks
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
rerankers: A Lightweight Python Library to Unify Ranking Methods
Automated Design of Agentic Systems
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Sapiens: Foundation for Human Vision Models
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
Writing in the Margins: Better Inference Pattern for Long Context Retrieval
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
The WAN Show
Cyber Security Headlines
Babbage from The Economist
Cybersecurity Today
Software Engineering Daily