Seven Common Causes of Data Leakage in Machine Learning
Key Steps in Data Preprocessing, Feature Engineering, and Train-test Splitting to Prevent Data Leakage When I was evaluating AI tools like ChatGPT, Claude, ...
Key Steps in Data Preprocessing, Feature Engineering, and Train-test Splitting to Prevent Data Leakage When I was evaluating AI tools like ChatGPT, Claude, ...
Swiss Public Transportation Usage I just came back from Switzerland. Two things that impressed me the most are the stunning scenery and the great public tra...
Monthly Job Openings Rate The visualization dataset this week also comes from the U.S. Bureau of Labor Statistics. It is the job openings rate report on the...
My Medium Articles! In the past two months, I continued writing data science and AI contents on Medium. I am super excited to have more than 1k followers no...
How AI can accelerate your ML projects from feature engineering to model training Context Welcome back to the third article of my series, ChatGPT vs. C...
Monthly Unemployed by Reason This week I was browsing the US Bureau of Labor Statistics website and found the latest report on monthly unmployment reasons. ...
Average Tech Salary in 2023 The dataset I am visualizing this week is the tech salary trends report from Dice. It offers valuable insights into the tech sal...
Five criteria to compare ChatGPT, Claude, and Gemini in tackling Exploratory Data Analysis Context Welcome back to the second installment of my series,...
US Income Inequality by Ethnicity For this week’s visualization, I recreated the chart from Pew Research Center, on their report on income inequality. It pr...
A step-by-step guide to creating a visualization discovery chatbot with OpenAI API, FAISS, and Streamlit Over the last six years, I’ve embarked on a jou...