In this episode I don't talk about data. In fact, I talk about metadata.
While many machine learning models rely on certain amounts of data eg. text, images, audio and video, it has been proved how powerful is the signal carried by metadata, that is all data that is invisible to the end user.
Behind a tweet of 140 characters there are more than 140 fields of data that draw a much more detailed profile of the sender and the content she is producing... without ever considering the tweet itself.
References
You are your Metadata: Identification and Obfuscation of Social Media Users using Metadata Information https://www.ucl.ac.uk/~ucfamus/papers/icwsm18.pdf
True Machine Intelligence just like the human brain (Ep. 155)
Delivering unstoppable data with Streamr (Ep. 154)
MLOps: the good, the bad and the ugly (Ep. 153)
MLOps: what is and why it is important Part 2 (Ep. 152)
MLOps: what is and why it is important (Ep. 151)
Can I get paid for my data? With Mike Andi from Mytiki (Ep. 150)
Building high-growth data businesses with Lillian Pierson (Ep. 149)
Learning and training in AI times (Ep. 148)
You are the product [RB] (Ep. 147)
Polars: the fastest dataframe crate in Rust - with Ritchie Vink (Ep. 146)
Apache Arrow, Ballista and Big Data in Rust with Andy Grove (Ep. 145)
Pandas vs Rust (Ep. 144)
Concurrent is not parallel - Part 2 (Ep. 143)
Concurrent is not parallel - Part 1 (Ep. 142)
Backend technologies for machine learning in production (Ep. 141)
You are the product (Ep. 140)
How to reinvent banking and finance with data and technology (Ep. 139)
What's up with WhatsApp? (Ep. 138)
Is Rust flexible enough for a flexible data model? (Ep. 137)
Is Apple M1 good for machine learning? (Ep.136)
Create your
podcast in
minutes
It is Free
Insight Story: Tech Trends Unpacked
Zero-Shot
Fast Forward by Tomorrow Unlocked: Tech past, tech future
Black Wolf Feed (Chapo Premium Feed Bootleg)
Bannon`s War Room