MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Evolutionary Optimization of Model Merging Recipes
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
Chronos: Learning the Language of Time Series
Linear Transformers with Learnable Kernel Functions are Better In-Context Models
SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
TripoSR: Fast 3D Object Reconstruction from a Single Image
Diffusion Model-Based Image Editing: A Survey
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
Cyber Security Headlines
The WAN Show
Babbage from The Economist
Software Engineering Daily
Cybersecurity Today