with experience in data exploration, cleaning, and modeling for strategic decision-making,
including machine learning solutions like diabetes risk prediction.
Expert in Power BI, SQL, Python, Excel and Tableau processes and data analysis in both on-premise and cloud environments.
Applied these skills to develop a diabetes prediction model using clinical data, achieving 82% accuracy through feature engineering and XGBoost algorithms.
Passionate about automation and generating valuable insights,
from building ETL pipelines to creating predictive models that support healthcare decisions.
annahico@gmail.com | +34 678567628 | Barcelona city, Spain
Development of a binary classification model to predict high academic performance in Portuguese and Mathematics subjects using student profile data (UCI Student Performance dataset). The objective is to evaluate the influence of socio-demographic, family, and school-related factors on academic success.
Highly skilled and detail-oriented Data Analyst with a proven ability to transform raw data into
actionable insights that drive strategic decision-making. Specialized in data exploration, cleaning, and
modeling (including predictive ML models) to identify patterns and trends that fuel business growth.
Proficient in Power BI, SQL, Python (Scikit-learn, Pandas), Excel and Tableau, with experience in building dynamic
visualizations,
developing data pipelines, and generating impactful reports. Strong foundation in ETL processes and data
management across on-premise and cloud environments (Azure, AWS, Google Cloud), including deployment of ML models.
Passionate about automation, optimizing workflows through scripting and machine learning solutions like diabetes risk prediction (achieved 82% accuracy with XGBoost). Adept at communicating complex findings clearly and effectively, enabling stakeholders at all levels to
make informed decisions.
Continuously expanding knowledge in machine learning (binary classification, feature engineering), artificial intelligence, and big data analytics,
leveraging data as a strategic asset to solve complex problems and drive innovation.
Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn
MySQL, PostgreSQL, Query Optimization
JavaScript, R (Basic)
Exploratory Data Analysis (EDA), Data Wrangling
Data Cleaning, Transformation, Feature Engineering
Jupyter Notebooks, Google Colab
DAX, Power Query, Data Modeling
Dashboards, Storytelling
Excel (Pivot Tables, Advanced Charts), Matplotlib/Seaborn
PostgreSQL, MySQL, SQL Server
MongoDB, Firebase
Data Pipelines, Apache Airflow
Scikit-learn, TensorFlow, Keras, PyTorch
Regression, Classification, Clustering, NLP
Cross-validation, Hyperparameter Tuning
Git, GitHub, GitLab, Bitbucket
AWS (S3, EC2, Lambda), Google Cloud, Azure
Agile, SCRUM, Kanban, CI/CD Pipelines
The Bridge - 600h (2024-2025)
Jesuïtes FP (In progress)
CIFO La Violeta - 100h (2024)
Universitat Oberta de Catalunya (2023)
Universitat de Barcelona (2016-2021)