TorchScale: Transformers at Scale
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
OneFormer: One Transformer to Rule Universal Image Segmentation
Large Language Models Are Human-Level Prompt Engineers
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
On the Versatile Uses of Partial Distance Correlation in Deep Learning
SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning
What Makes Convolutional Models Great on Long Sequence Modeling?
Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second
Long Range Graph Benchmark
Taming Transformers for High-Resolution Image Synthesis
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
GLM-130B: An Open Bilingual Pre-trained Model
Elucidating the Design Space of Diffusion-Based Generative Models
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
DigiFace-1M: 1 Million Digital Face Images for Face Recognition
Human Motion Diffusion Model
TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Join Podbean Ads Marketplace and connect with engaged listeners.
Advertise Today
Create your
podcast in
minutes
It is Free
gm! crypto
Big Technology Podcast
Cyber Security Headlines
Techmeme Ride Home
The 404 Media Podcast