-
캐글 커널 커리큘럼<Kaggle> 2022. 5. 31. 20:41728x90
Binary classification : Tabular data
1st level. Titanic: Machine Learning from Disaster
- 타이타닉 튜토리얼 1 - Exploratory data analysis, visualization, machine learning
- EDA To Prediction(DieTanic)
- Titanic Top 4% with ensemble modeling
- Introduction to Ensembling/Stacking in Python
2nd level. Porto Seguro’s Safe Driver Prediction
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BPorto%20seguro-safe%20driver%20prediction%5D
- Data Preparation & Exploration
- Interactive Porto Insights - A Plot.ly Tutorial
- XGBoost CV (LB .284)
- Porto Seguro Exploratory Analysis and Prediction
3rd level. Home Credit Default Risk
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BHome%20Credit%20Default%20Risk%5D
- Introduction: Home Credit Default Risk Competition
- Introduction to Manual Feature Engineering
- Stacking Test-Sklearn, XGBoost, CatBoost, LightGBM
- LightGBM 7th place solution
Multi-class classification : Tabular data
1st level. Costa Rican Household Poverty Level Prediction
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BCosta%20Rican%20Household%20Poverty%20Level%5D
Binary classification : Image classification
1st level. Statoil/C-CORE Iceberg Classifier Challenge
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BStatoil%2C%20C-CORE%20Iceberg%20Classifier%5D
- Keras Model for Beginners (0.210 on LB)+EDA+R&D
- Transfer Learning with VGG-16 CNN+AUG LB 0.1712
- Submarineering.EVEN BETTER PUBLIC SCORE until now.
- Keras+TF LB 0.18
Multi-class classification : Image classification
1st level. TensorFlow Speech Recognition Challenge
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BTensorFlow%20Speech%20Recognition%5D
- Speech representation and data exploration
- Light-Weight CNN LB 0.74
- WavCeption V1: a 1-D Inception approach (LB 0.76)
Regression : Tabular data
1st level. New York City Taxi Trip Duration
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BNew%20York%20City%20Taxi%20Trip%20Duration%5D
2nd level. Zillow Prize: Zillow’s Home Value Prediction (Zestimate)
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BZillow%E2%80%99s%20Home%20Value%20Prediction%5D
- Simple Exploration Notebook - Zillow Prize
- Simple XGBoost Starter (~0.0655)
- Zillow EDA On Missing Values & Multicollinearity
- XGBoost, LightGBM, and OLS and NN
Object segmentation : Deep learning
1st level. 2018 Data Science Bowl
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5B2018%20Data%20Science%20Bowl%5D
- Teaching notebook for total imaging newbies
- Keras U-Net starter - LB 0.277
- Nuclei Overview to Submission
Natural language processing : classification, regression
1st level. Spooky Author Identification
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BSpooky%20Author%20Identification%5D
- Spooky NLP and Topic Modelling tutorial
- Approaching (Almost) Any NLP Problem on Kaggle
- Simple Feature Engg Notebook - Spooky Author
2nd level. Mercari Price Suggestion Challenge
총정리 : https://9566.tistory.com/category/%3CKaggle%3E/%5BMercari%20Price%20Suggestion%5D
- Mercari Interactive EDA + Topic Modelling
- A simple nn solution with Keras (~0.48611 PL)
- Ridge (LB 0.41943)
- LGB and FM [18th Place - 0.40604]
3rd level. Toxic Comment Classification Challenge
- [For Beginners] Tackling Toxic Using Keras
- Stop the S@#$ - Toxic Comments EDA
- Logistic regression with words and char n-grams
- Classifying multi-label comments (0.9741 lb)
Other dataset : anomaly detection, visualization
1st level. Credit Card Fraud Detection
- In depth skewed data classif. (93% recall acc now)
- Anomaly Detection - Credit Card Fraud Analysis
- Semi-Supervised Anomaly Detection Survey
2nd level. Kaggle Machine Learning & Data Science Survey 2017
728x90'<Kaggle>' 카테고리의 다른 글
2022-07-06 credit-fraud-dealing-with-imbalanced-datasets (0) 2022.07.06 4/26 (0) 2022.04.29 3/29 (0) 2022.03.29 2022-03-04 (0) 2022.03.15 2022-03-03 (0) 2022.03.15