Diogo Ramalho

https://www.linkedin.com/in/dramalhoeng

View the Project on GitHub diogormec/CV

πŸ‘¨β€πŸ’» Diogo Ramalho – Data Scientist | Data Analyst

I am a Data Scientist with a solid engineering foundation, holding a Master’s in Energy & Environment (University of Coimbra) and a Bachelor’s in Automotive Engineering (Polytechnic Institute of Leiria). My professional background combines more than a decade of experience in Engineering and Production Management with recent specialization in Data Science and Machine Learning.

Currently, I work as a Junior AI & ML Data Specialist, designing predictive models, automating complex data workflows, and creating business intelligence dashboards that support data-driven decision-making.


🌟 Highlights / Key Achievements


πŸ’Ό Current Role

Junior AI & ML Data Specialist (May 2025 – Present)


πŸŽ“ Education & Training


🧠 Technical Skills

Programming & Data: Python, R, SQL, PySpark, Pandas, NumPy
Visualization: Power BI, Tableau, Matplotlib, Seaborn
Machine Learning & AI: scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, PCA, K-Means, Model Interpretability
Computer Vision: YOLOv8, OpenCV, Roboflow, Ultralytics
Databases: MySQL, PostgreSQL
Deployment & Tools: Flask, Railway, Jupyter Notebook, Google Colab, Git/GitHub, joblib
Cloud: AWS, Google Cloud


πŸ“ˆ Competitor Price Forecasting (LDSSA)

End-to-end ML pipeline for competitor price prediction using LightGBM, with comprehensive feature engineering, time-aware cross-validation, and deployment as a production-ready Flask API on Railway, fully integrated with a database for real-time predictions.
Technical Skills: Python, LightGBM, Feature Engineering, Flask, Railway, SQL, API Deployment, Time Series Modeling


πŸ’Ό Financial and Predictive Analysis of Personnel Expenses (CESAE)

Led a comprehensive HR analytics project, developing a Business Intelligence system for personnel cost analysis and forecasting. Generated actionable insights for budgeting, staff planning, and retention strategies.

Key Achievements:

Technical Skills: Python (Pandas, NumPy, Statsmodels), SQL, Power BI, Excel, Data Visualization, Predictive Modeling


πŸͺ Retail Sales Forecasting (EDIT)

Performed exploratory data analysis, clustering of stores using K-Means and PCA, and predictive modeling with XGBoost (~79% accuracy). Integrated results into an interactive Power BI dashboard for business decision support.
Technical Skills: Python, XGBoost, PCA, K-Means, Power BI, EDA, Feature Engineering


🧷 Zipper Object Detection with YOLOv8

Developed a computer vision model for detecting zippers in images, trained on a custom dataset managed via Roboflow. Used YOLOv8m, achieving mAP50 = 0.97 with high precision and recall.
Technical Skills: YOLOv8, Roboflow, Python, Ultralytics, OpenCV, Google Colab, Object Detection, Model Evaluation


🏍 MotoGP Data Automation & Reporting

Automated data extraction and processing from unstructured MotoGP PDFs using advanced Python and regex, creating interactive dashboards in Power BI and Tableau to explore rider and team performance.
Technical Skills: Python, Regex, PDFPlumber, Pandas, Power BI, Tableau, Data Wrangling, Data Visualization


πŸ”¬ Breast Cancer Diagnosis Using Machine Learning

Built a classifier to distinguish malignant vs. benign tumors using the Breast Cancer Wisconsin Diagnostic Dataset, evaluating multiple ML algorithms.
Technical Skills: Python, scikit-learn, Random Forest, Logistic Regression, SVM, Classification, Model Evaluation


πŸ“š Certifications


🌐 Explore More

πŸ”— Portfolio Website
πŸ”— LinkedIn
πŸ”— GitHub


🀝 Interested in collaborating?
Feel free to reach out for data science opportunities or project discussions!