Every machine learning model has to start with feature engineering. This is the process of combining input variables into a more meaningful signal for the problem that you are trying to solve. Many times this process can lead to duplicating code from previous projects, or introducing technical debt in the form of poorly maintained feature pipelines. In order to make the practice more manageable Soledad Galli created the feature-engine library. In this episode she explains how it has helped her and others build reusable transformations that can be applied in a composable manner with your scikit-learn projects. She also discusses the importance of understanding the data that you are working with and the domain in which your model will be used to ensure that you are selecting the right features.
AnnouncementsThe intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA
Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks
Build A Full Stack ML Powered App In An Afternoon With Baseten
Skip Straight To The Fun Part Of Your Project With PyScaffold
Add Configuration Best Practices To Your Application In An Afternoon With Dynaconf
Take A Tour Of The Hidden Language Of Hardware And How It Powers Your Code
Take Control Of Your Electrical Systems With The Open Source FlexMeasures Energy Management System
How And Why To Build Effective Teams As An Engineering Leader
Complete Your Hardware "Weekend Projects" In An Actual Weekend With Belay
Catching Up With Pyre, A Fast Type Checker For Python
Standardizing On Python For All Software Projects At Ascend.io
Exploring The Process And Practice Of Building Better Software Through Code Reviews
Ship With Confidence By Automating Quality Assurance
Remove Roadblocks And Let Your Developers Ship Faster With Self-Serve Infrastructure
The Benefits Of Python And Django For Going From Zero To MVP At Speed
Powering The Next Generation Of Application Architectures With Web Assembly And The Fermyon Platform
Gain A Deeper Understanding Of What Your Code Is Doing And Where It Spends Its Time With VizTracer
Stream Processing In Real Time And At Scale In Pure Python With Bytewax
Tetra: A Full Stack Web Framework That Doesn't Make You Write Everything Twice
Create your
podcast in
minutes
It is Free
Insight Story: Tech Trends Unpacked
Zero-Shot
Fast Forward by Tomorrow Unlocked: Tech past, tech future
The Unbelivable Truth - Series 1 - 26 including specials and pilot
Lex Fridman Podcast