https://www.linkedin.com/in/dramalhoeng
I am a Data Scientist with a solid engineering foundation, holding a Masterβs in Energy & Environment (University of Coimbra) and a Bachelorβs in Automotive Engineering (Polytechnic Institute of Leiria). My professional background combines more than a decade of experience in Engineering and Production Management with recent specialization in Data Science and Machine Learning.
Currently, I work as a Junior AI & ML Data Specialist, designing predictive models, automating complex data workflows, and creating business intelligence dashboards that support data-driven decision-making.
Junior AI & ML Data Specialist (May 2025 β Present)
Lisbon Data Science Starters Academy (LDSSA) β Remote (~350h, Oct 2024 β Jul 2025)
Intensive 34-week part-time program focusing on practical Data Science applications. Covered Python, supervised learning (classification & regression), time series, NLP, recommender systems, and model evaluation. Participated in 6 collaborative hackathons and completed a final Capstone Project with API deployment and technical report.
Key skills: Python, Pandas, NumPy, scikit-learn, Jupyter Notebook, Git/GitHub, ML pipelines, feature engineering, hyperparameter tuning, data storytelling, API development (Flask/FastAPI).
Data Analyst Program β CESAE Digital β (1050h, Nov 2024 β Jul 2025)
Comprehensive data analysis training with applied business projects, including SQL, Python, R, Power BI, and ETL pipelines. Integrated 3-month internship at Crunch, applying skills to real-world data projects and delivering actionable insights. Offered position as Junior AI & ML Data Specialist upon completion.
Data Science & Business Analytics β EDIT. β Disruptive Digital Education (216h, Sep 2023 β Feb 2024)
Hands-on program covering Python, Machine Learning, PySpark, SQL, and Business Intelligence, culminating in a real-world project with a strategic dashboard.
Academic Degrees:
Programming & Data: Python, R, SQL, PySpark, Pandas, NumPy
Visualization: Power BI, Tableau, Matplotlib, Seaborn
Machine Learning & AI: scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, PCA, K-Means, Model Interpretability
Computer Vision: YOLOv8, OpenCV, Roboflow, Ultralytics
Databases: MySQL, PostgreSQL
Deployment & Tools: Flask, Railway, Jupyter Notebook, Google Colab, Git/GitHub, joblib
Cloud: AWS, Google Cloud
End-to-end ML pipeline for competitor price prediction using LightGBM, with comprehensive feature engineering, time-aware cross-validation, and deployment as a production-ready Flask API on Railway, fully integrated with a database for real-time predictions.
Technical Skills: Python, LightGBM, Feature Engineering, Flask, Railway, SQL, API Deployment, Time Series Modeling
Led a comprehensive HR analytics project, developing a Business Intelligence system for personnel cost analysis and forecasting. Generated actionable insights for budgeting, staff planning, and retention strategies.
Key Achievements:
Technical Skills: Python (Pandas, NumPy, Statsmodels), SQL, Power BI, Excel, Data Visualization, Predictive Modeling
Performed exploratory data analysis, clustering of stores using K-Means and PCA, and predictive modeling with XGBoost (~79% accuracy). Integrated results into an interactive Power BI dashboard for business decision support.
Technical Skills: Python, XGBoost, PCA, K-Means, Power BI, EDA, Feature Engineering
Developed a computer vision model for detecting zippers in images, trained on a custom dataset managed via Roboflow. Used YOLOv8m, achieving mAP50 = 0.97 with high precision and recall.
Technical Skills: YOLOv8, Roboflow, Python, Ultralytics, OpenCV, Google Colab, Object Detection, Model Evaluation
Automated data extraction and processing from unstructured MotoGP PDFs using advanced Python and regex, creating interactive dashboards in Power BI and Tableau to explore rider and team performance.
Technical Skills: Python, Regex, PDFPlumber, Pandas, Power BI, Tableau, Data Wrangling, Data Visualization
Built a classifier to distinguish malignant vs. benign tumors using the Breast Cancer Wisconsin Diagnostic Dataset, evaluating multiple ML algorithms.
Technical Skills: Python, scikit-learn, Random Forest, Logistic Regression, SVM, Classification, Model Evaluation
π Portfolio Website
π LinkedIn
π GitHub
π€ Interested in collaborating?
Feel free to reach out for data science opportunities or project discussions!