Simon Mak | Integrating Science into Stats Models
#statistics #science #ai
It’s a common dictum that statisticians need to incorporate domain knowledge into their modeling and the interpretation of their results. But how deeply can scientific principles be embedded into statistical models? Prof. Simon Mak (Duke University) is pushing this idea to the limit by integrating fundamental physics, physiology, and biology into both the models and model inference. This includes Simon’s joint work with Profs. David Dunson and Ruda Zhang (also of Duke University).
Scientific reasoning AND stats. What more could we ask for?
Enjoy!
Watch it on....
YouTube: https://youtu.be/bUbZO7R4z40
Podbean: https://dataandsciencepodcast.podbean.com/e/simon-mak-integrating-science-into-stats-models/
00:00 - COMING UP….Scientists & Statisticians
02:09 - Introduction - Integrating scientific knowledge into AI/ML
06:08 - How much domain knowledge is sufficient?
09:15 - Choosing which prior knowledge to integrate into a model
14:49 - Black box & gray box optimization
19:50 - Non-physics examples of integrating scientific theory into ML models
22:45 - Scientific principles & modeling at different scales
27:20 - Correlation is one just way of modeling linkage
36:37 - Conditional independence & different-fidelity experiments
39:40 - Innovation vs incorporation of known information in the model
42:52 - Aortic stenosis example
52:49 - Which mathematics can be used to represent scientific knowledge
57:09 - How to acquire scientific domain knowledge
1:02:45 - Complementary approaches to integrating science
1:06:48 - Gaussian process & integrating priors over functions
1:12:48 - A topic for statisticians and scientists to debate:science-based vs data-based learning.
Simon Mak's Webpage: https://sites.google.com/view/simonmak/home
Keith O’Rourke | The Logic of Statistics
Jack Fitzsimons | Evil Models: Hiding Malware in Neural Networks
Scott Cunningham | Causal Inference (The Mixtape)
Eric Daza | Important Ideas in Causal Inference
Wenting Cheng & Weidong Zhang | Advances in Biotech/Biopharma
Ruda Zhang | Gaussian Process Subspace Regression
Ruda Zhang | Math-Science Duality
Martin Goodson | Practical Data Science & The UK’s AI Roadmap
Jack Fitzsimons | Data Security, Privacy, & Artificial Intelligence
Chris Tosh | The piranha problem in statistics
Chris Holmes | AI, Digital Health, & The Alan Turing Institute
Philosophy of Data Science | Deborah Mayo | Revolutions, Reforms, and Severe Testing in Statistical Thinking
Charlotte Deane | Bioinformatics, Deepmind’s AlphaFold 2, and Llamas
Eric Schwitzgebel | Consciousness, Zombies, & First Person Data | Philosophy of Data Science
Starting a Statistics Consultancy | Janet Wittes
Philosophy of Data Science | Jingyi Jessica Li | Advancing Statistical Genomics
Mine Çetinkaya-Rundel | Advancing Open Access Data Science Education
Jingyi Jessica Li | Statistical Hypothesis Testing vs Machine Learning Binary Classification
Gualtiero Piccinini | What Are First-Person Data? | Philosophy of Data Science
Create your
podcast in
minutes
It is Free
DNA Today: A Genetics Podcast
Museum of the Missing
Strange by Nature Podcast
Sasquatch Chronicles
Hidden Brain