Reading Notes 2021 Sept - Oct
This is my fifth blog of this series (and second to last for this year), summarising the great posts Elise and I came across during our Friday and Sunday night reading sessions. Hope you enjoy the reading as well :)
Experimentation
- Online Experiments Tricks — Variance Reduction: Talks about common variance reduction methods in experimentations
- Improving Experimentation Efficiency at Netflix with Meta Analysis and Optimal Stopping: Two techniques Netflix use to improve their experimentation efficiency
- How to Reduce A\B Testing Duration using Surrogate Metrics:How to use the Surrogate Metrics to better estimate long-term impact
- Bayesian A/B Testing in 5 Minutes: Quick walkthrough of Bayesian A/B testing steps
- Bayesian A/B Testing - Part 0 - Introduction, Part I - Conversions, Part II - Revenue, Part III - Test Duration, Part IV - Choosing a Prior : More on the same topic
Machine Learning & Analytics
- Product Analytics: Engagement Model: How to build an engagement model to gain insights into user behavior and product development
- 5 Techniques to Work with Imbalanced Data in Machine Learning: Common techniques to handle imbalance data
- 7 Oversampling Techniques to Handle Imbalanced Data: Same topic as the above one, but dive deep into the various oversampling techniques
- A Zero Math Understanding of Bayesian Optimization: A very good analogy and explanation of Bayesian Optimization
- Advertiser Recommendation Systems at Pinterest: Talks about potential product opportunities using the insights generated from the machine learning model at Pinterest ads product – a very good example of how to create user-facing value from data science
- The Machine Learning Behind Delivering Relevant Ads: Also from the Pinterest ads team, but touches more on how the ads delivery algorithm is implemented
- Marketing Mix Modeling - Introduction to Marketing Mix Modeling in Python, An Upgraded Marketing Mix Modeling in Python: A series of two articles that explains the marketing mix modeling very clearly with great insights into how to combine marketing concepts into it
- Top 5 Time Series Analytics: General introduction on very useful time series analytics techniques
- Hate Black-box Models? Time to Change That With SHAP: A great overview of what is SHAP value and how it helps with model interpretability
- 10 Exciting Examples of Machine Learning Applications in Healthcare: Examples of how machine learning could be used to improve healthcare
- Why You’ll Regret Training ML Models: General considerations before you starting training your machine learning models
- Predict Customer Churn (the right way) using PyCaret: A detailed walkthrough on how to use PyCaret to quickly build a churn prediction model
- MOUSE Movement Modelling to Predict Online Fraud: A very interesting idea of tracking mouse movement data to detect fraud
Data Platform
- Automating Data Production At Scale - Part 1: How Airbnb designed a data protection platform
- Signs You Are Using Data Visualization Tools Wrong: Discussion on how you should use your data visualization tools within an organization
Others
- Emojis in Your Data: A fun reading on how emojis are stored in database
- Virtual Presentation Tips for Data Scientists: Great advice on data scientists’ presentations
- The 10 Best Data Visualizations of 2021: 10 insightful data visualizations the author selected from Reddit
- Key Metrics for Data Science Team Success: A great article on how to measure the success of the data science team as a leader
- Designing and evaluating metrics: Key principles on designed good metrics