Get The Data Thread in your inbox here
In The News
“New guidelines on freedom and privacy protection signal that the Chinese state is open to dialogue about how it uses technology.”
AI & Machine Learning
Transfer learning is a technique that enables the transfer of knowledge learned from one dataset to another. I wanted to make transfer learning easy to use for text classification. Through this project, I was able to achieve 83% classification accuracy on the IMDB movie reviews dataset with only 500 labeled samples
The artificial-intelligence industry runs on the invisible labor of humans working in isolated and often terrible conditions—and the model is spreading to more and more businesses.
How did you learn to talk?
Probably something like this: Your infant brain, a hotbed of neurological activity, picked up on your parents’ speech tones and facial expressions. You started to mimic their sounds, interpret their emotions and identify relatives from strangers.
Business Intelligence & Data Viz
For every decade, we’ve summarized the words that most frequently appear in NYT headlines, sizing each word relative to its disproportionate popularity. Selecting each one of these will allow you to see its popularity throughout the past century
Power BI has added the ability to set the Data Category property on measures as well as columns in tables. This means it is now possible to have the output of a DAX measure displayed as an image in a Power BI report and this in turn opens up a lot of new possibilities
My attempt here is to provide a relative perspective of what each of these resources excels at. I’ve tried to cover most of the resources I found commonly recommended by data viz twitter and the book buying algorithms.
Training & Resources
Statistics for Data Science: Introduction to t-test and its Different Types (with Implementation in R)
Hypothesis testing is one of the most fascinating things we do as data scientists. No idea is off-limits at this stage of our project. I have personally seen so many insights coming out of hypothesis testing – insights most of us would have missed if not for this stage!
In this article, you learn how to get data for management accounting with the balance sheet and income statement in R. Furthermore you learn how to prepare the balance sheet and income statement in R and how to export it to Excel…
Simple predictions are all cases of linear regression. We first observe the trend and then predict based on the trend e.g. How hard you must brake depending on the distance of the car ahead of you. Not all of situations follow a linear trend though. e.g. the rise of bitcoin from 2015 to 2016 was linear but in 2017 it suddenly became exponential.
The Voter Turnout Database is the best resource for a wide array of statistics on voter turnout from around the world. It contains the most comprehensive global collection of voter turnout statistics from presidential and parliamentary elections since 1945.
Guesstimation is a book that unlocks the power of approximation–it’s popular mathematics rounded to the nearest power of ten! The ability to estimate is an important skill in daily life… Guesstimation enables anyone with basic math and science skills to estimate virtually anything–quickly–using plausible assumptions and elementary arithmetic.
About Me: Pedro A. Medina, E.E. lives at the intersection of advanced analytics, business intelligence, and social responsibility. He inspires a new breed of innovative Data Scientists at Haystack Data Solutions. Connect with him on LinkedIn: www.linkedin.com/in/pedroalexandermedina
Need Help with Your Data?
Do you need help with any of the following:
- Data Science or Data Visualization?
- Staff Augmentation?
- Managed Data Science Services?
- Data Strategy or Training?
Share the Love
If you enjoyed this post, please leave a comment below or share on your social networks. Don’t forget to follow us!