Arun Rathi
🌈 Hi there!! 👋 Glad to see you here.
💬 Ask me about Data Science, Machine Learning and Math.
⚡ Fun fact: Data is never clean
🌈 Hi there!! 👋 Glad to see you here.
Main role is built a machine learning model
- Worked on a Deep Learning model to detect the anomalies in a multivariate time series
data.
- Univariate time series data analysis using Prophet (Facebook's open source library),
StumPy.
- Worked on multiple research papers to built a deep learning model for anomaly
detection.
- Built a dashboard for time series dataset to display the trends and seasonality.
- Worked on a Java project using OOPs
Start my kaggle journey to learn from data science community(Kaggler's) and looking forward to participate in the Kaggle competition
Built a machine learning model to diagnose the cancer patient.
- Train multiple machine learning model i.e Naive Bayes, K-nearest Neighbors, Random Forest
and XGBoost algorithms.
- Found XGBoost - best performer model, used logloss, Precision and Recall as evaluation
metrics.
Used publicly available dataset from [NYC Yellow
Taxi](https://www1.nyc.gov/site/tlc/index.page)
- First Model used is the Moving Averages Model which uses the previous n values in order to
predict the next value
- Other models - Weighted Moving Averages, Exponential Weighted Moving Averages, EWMA -
perform well compared to others.
- Regression models - Linear Regression, Random Forest Regressor, XgBoost Regressor.
- XgBoost Regressor performs well in Regression models but there is slightly difference in
the MAPE value between EWMA and XgBoost.
- Learning - Simple model can perform well.
Built multiple machine learning model to find best performer model.
- Used Feature Embedding methods - Bag of Words (BoW), TF-IDF vectorizer
- Found XGBoost - best performer model
- Evaluation Metrics - AUC and F1-score
GPA: 7.98
GPA: 8.01