Download - Revisiting Classifier: Transferring Vision-Language Models for Video Recognition | Podbean

Discover

Podcast Features
Monetization
Podbean App
- Podcast Studio
  Easy-to-use audio recorder app.
- Podcast App
  The best podcast player & podcast app.

Help and Support
Popular Topics

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Advertisers
Enterprise
Pricing
Resources
- Help and Support
- Popular Topics
Discover

Log in

Sign up free

Papers Read on AI

News:Tech News

Revisiting Classifier: Transferring Vision-Language Models for Video Recognition

2022-12-21

Download

Transferring knowledge from task-agnostic pre-trained deep models for downstream tasks is an important topic in computer vision research. Along with the growth of computational capacity, we now have open-source vision-language pre-trained models in large scales of the model architecture and amount of data. In this study, we focus on transferring knowledge for video classiﬁcation tasks. Conventional methods randomly initialize the linear classiﬁer head for vision classiﬁcation, but they leave the usage of the text encoder for downstream visual recognition tasks undiscovered. In this paper, we revise the role of the linear classiﬁer and replace the classiﬁer with different knowledge from the pre-trained model. 2022: Wenhao Wu, Zhun Sun, Wanli Ouyang Ranked #1 on Action Recognition on ActivityNet https://arxiv.org/pdf/2207.01297v3.pdf

view more

More Episodes

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

2024-11-01

527

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

2024-10-31

168

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

2024-10-30

122

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

2024-10-18

232

LightRAG: Simple and Fast Retrieval-Augmented Generation

2024-10-17

210

Aria: An Open Multimodal Native Mixture-of-Experts Model

2024-10-16

110

AgentKit: Structured LLM Reasoning with Dynamic Graphs

2024-10-15

139

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

2024-10-14

115

Diffusion Models are Evolutionary Algorithms

2024-10-10

170

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering

2024-10-09

113

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

2024-10-08

149

Internal Consistency and Self-Feedback in Large Language Models: A Survey

2024-10-07

118

On the Diagram of Thought

2024-10-02

143

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

2024-10-01

108

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

2024-09-30

110

On the limits of agency in agent-based models

2024-09-24

172

Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization

2024-09-23

109

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

2024-09-22

100

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

2024-09-21

136

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

2024-09-20

96

←
1
2
3
4
5
6
7
8
9
10
→

012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store

Download Podbean app on Google Play

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

Get started

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2026 Podbean.com