The Real Python Podcast - Measuring Bias, Toxicity, and Truthfulness in LLMs With Python | Free Listening on Podbean App

Discover

Podcast Features
Your all-in-one podcasting solution.

Podcast Studio
Easy-to-use audio recorder app.
Livestream
High-performing audio live, without limits.

Podcast App
The best podcast player & podcast app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Patron & Paid Content
The seamless way for fans to support you directly
from your podcast.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Enterprise
Pricing
Discover

The Real Python Podcast

Technology

Measuring Bias, Toxicity, and Truthfulness in LLMs With Python

2024-01-19

How can you measure the quality of a large language model? What tools can measure bias, toxicity, and truthfulness levels in a model using Python? This week on the show, Jodie Burchell, developer advocate for data science at...

How can you measure the quality of a large language model? What tools can measure bias, toxicity, and truthfulness levels in a model using Python? This week on the show, Jodie Burchell, developer advocate for data science at JetBrains, returns to discuss techniques and tools for evaluating LLMs With Python.

Jodie provides some background on large language models and how they can absorb vast amounts of information about the relationship between words using a type of neural network called a transformer. We discuss training datasets and the potential quality issues with crawling uncurated sources.

We dig into ways to measure levels of bias, toxicity, and hallucinations using Python. Jodie shares three benchmarking datasets and links to resources to get you started. We also discuss ways to augment models using agents or plugins, which can access search engine results or other authoritative sources.

This week’s episode is brought to you by Intel.

Course Spotlight: Learn Text Classification With Python and Keras

In this course, you’ll learn about Python text classification with Keras, working your way from a bag-of-words model with logistic regression to more advanced methods, such as convolutional neural networks. You’ll see how you can use pretrained word embeddings, and you’ll squeeze more performance out of your model through hyperparameter optimization.

Topics:

00:00:00 – Introduction
00:02:19 – Testing characteristics of LLMs with Python
00:04:18 – Background on LLMs
00:08:35 – Training of models
00:14:23 – Uncurated sources of training
00:16:12 – Safeguards and prompt engineering
00:21:19 – TruthfulQA and creating a more strict prompt
00:23:20 – Information that is out of date
00:26:07 – WinoBias for evaluating gender stereotypes
00:28:30 – BOLD dataset for evaluating bias
00:30:28 – Sponsor: Intel
00:31:18 – Using Hugging Face to start testing with Python
00:35:25 – Using the transformers package
00:37:34 – Using langchain for proprietary models
00:43:04 – Putting the tools together and evaluating
00:47:19 – Video Course Spotlight
00:48:29 – Assessing toxicity
00:50:21 – Measuring bias
00:54:40 – Checking the hallucination rate
00:56:22 – LLM leaderboards
00:58:17 – What helped ChatGPT leap forward?
01:06:01 – Improvements of what is being crawled
01:07:32 – Revisiting agents and RAG
01:11:03 – ChatGPT plugins and Wolfram-Alpha
01:13:06 – How can people follow your work online?
01:14:33 – Thanks and goodbye

Background Links:

A Beginner’s Guide to Language Models - Built In
ChatGPT - Explained! - YouTube

Dataset Links:

truthful_qa - Datasets at Hugging Face
wino_bias - Datasets at Hugging Face
bold - Datasets at Hugging Face

Tutorials and Documentation for Python Packages:

Evaluating Language Model Bias with 🤗 Evaluate
Hugging Face - HF_bias_evaluation - Google Colab
General Usage - Load a Dataset - Hugging Face
What is Text Generation? - Hugging Face
🤗 Evaluate - Library Evaluating ML Models
Python Quickstart - 🦜️🔗 Langchain

Measurement Links:

Toxicity - a Hugging Face Space by evaluate-measurement
Regard - a Hugging Face Space by evaluate-measurement
Open LLM Leaderboard - a Hugging Face Space

Training Data for LLMs:

Common Crawl - Open Repository of Web Crawl Data
The Pile
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora

Agents and Plugin Links:

Transformers Agents - Hugging Face
Agents - 🦜️🔗 Langchain
ChatGPT Gets Its “Wolfram Superpowers”! - Stephen Wolfram

Additional Links:

Inside the AI Factory: The Humans that Make Tech Seem Human - The Verge
Jodie Burchell - The JetBrains Blog
Jodie Burchell’s Blog - Standard error
Jodie Burchell 🇦🇺🇩🇪 (@t_redactyl) - Twitter
Jodie Burchell 🇦🇺🇩🇪 (@t_redactyl@fosstodon.org) - Fosstodon
JetBrains: Essential tools for software developers and teams

Level up your Python skills with our expert-led courses:

Data Cleaning With pandas and NumPy
Creating Web Maps From Your Data With Python Folium
Learn Text Classification With Python and Keras

Support the podcast & join our community of Pythonistas

More Episodes

Considering Accessibility & Assistive Tech as a Python Developer

Querying OpenStreetMaps via API & Lazy Evaluation in Python

Embarking on a Relaxed and Friendly Python Coding Journey

Pydantic Data Validation & Python Web Security Practices

Decoupling Systems to Get Closer to the Data

Avoiding Error Culture and Getting Help Inside Python

Leveraging Documents and Data to Create a Custom LLM Chatbot

Build a Video Game With Python Turtle & Visualize Data in Seaborn

Using Python in Bioinformatics and the Laboratory

Exploring Duck Typing in Python & Dynamics of Monkey Patching

Building a Healthy Developer Mindset While Learning Python

Automate Tasks With Python & Building a Small Search Engine

Wes McKinney on Improving the Data Stack & Composable Systems

Practical Python Decorator Uses & Avoiding datetime Pitfalls

Focusing on Data Science & Less on Engineering and Dependencies

Great Starting Points for Contributing to Open Source

Building a Python Debugger & Preparing for NumPy 2.0

Serializing Data With Python & Underscore Naming Conventions

Exploring Python in Excel

←
1
2
3
4
5
6
7
8
9
10
→

0123456789101112131415161819

Create your
podcast in
minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

It is Free

You may also like

Insight Story: Tech Trends Unpacked

Zero-Shot

Fast Forward by Tomorrow Unlocked: Tech past, tech future

The Unbelivable Truth - Series 1 - 26 including specials and pilot

Lex Fridman Podcast

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2024 Podbean.com