Download - Revisiting Classifier: Transferring Vision-Language Models for Video Recognition | Podbean

Discover

Podcast Features
Monetization
Podbean App
- Podcast Studio
  Easy-to-use audio recorder app.
- Podcast App
  The best podcast player & podcast app.

Help and Support
Popular Topics

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Advertisers
Enterprise
Pricing
Resources
- Help and Support
- Popular Topics
Discover

Log in

Sign up free

Papers Read on AI

News:Tech News

Revisiting Classifier: Transferring Vision-Language Models for Video Recognition

2022-12-21

Download

Transferring knowledge from task-agnostic pre-trained deep models for downstream tasks is an important topic in computer vision research. Along with the growth of computational capacity, we now have open-source vision-language pre-trained models in large scales of the model architecture and amount of data. In this study, we focus on transferring knowledge for video classiﬁcation tasks. Conventional methods randomly initialize the linear classiﬁer head for vision classiﬁcation, but they leave the usage of the text encoder for downstream visual recognition tasks undiscovered. In this paper, we revise the role of the linear classiﬁer and replace the classiﬁer with different knowledge from the pre-trained model. 2022: Wenhao Wu, Zhun Sun, Wanli Ouyang Ranked #1 on Action Recognition on ActivityNet https://arxiv.org/pdf/2207.01297v3.pdf

view more

More Episodes

AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls

2024-08-13

118

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

2024-08-09

119

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

2024-08-08

115

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

2024-08-07

89

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

2024-08-05

133

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

2024-07-31

118

FinanceBench: A New Benchmark for Financial Question Answering

2024-07-30

120

Stable-Hair: Real-World Hair Transfer via Diffusion Model

2024-07-29

112

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

2024-07-26

135

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

2024-07-25

105

Patch-Level Training for Large Language Models

2024-07-24

109

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

2024-07-23

101

IMAGDressing-v1: Customizable Virtual Dressing

2024-07-22

97

A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights

2024-07-19

120

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

2024-07-18

133

SEED-Story: Multimodal Long Story Generation with Large Language Model

2024-07-16

131

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

2024-07-15

125

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

2024-07-12

123

Agentless: Demystifying LLM-based Software Engineering Agents

2024-07-11

128

Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

2024-07-09

127

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

Get started

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2026 Podbean.com