Pedro Águas Marques Resume

Summary

pedromar2010+interview@gmail.com

Master’s graduate in Computer Engineering from Instituto Superior Técnico, University of Lisbon. 4+ years in IT in companies such as Vodafone, contributing to data projects.

Started working at 16, full-stack developer at a startup for 6 months.

Freelance project at 17, built functional desktop movie theater .

Team programming contest at 18, showcasing collaboration.

Versatile roles at 19, non-profit work, student commission coordinator.

Summer intern at 20, architected Vodafone’s micro-services.

Data Scientist at 21, led deep learning recommendation system.

Currently, I automate pipelines in Scala.

Passionate about technology, I’m an avid learner and an active member of the AWS User Group Lisbon.

Beyond my tech interests, I excel in running, climbing, swimming, cycling, canoeing, and tennis.

Seeking new opportunities in Data/Software Engineering, ready to learn and excel.

Highlights

Certification on Scala
Python, Scala, SQL, terraform, JavaScript, Java
AWS EMR, S3, ECS and many more
Spark, Airflow, Snowflake, Docker, Databricks, GitHub
D3.js, NodeJS, Pytorch, scikit-learn
MLFlow, Tableau, Plotly
multiple certifications from DeepLearning.AI Deep Learning specialization

Testimonials

Pedro Luis

Tech Lead at Carpe Data

Pedro Aguas was one of the main points of contact for anything related to Airflow, he heavily impacted the automation and efficiency across products.

He always brought good ideas and genuinely cared about the job and colleges. Additionally Pedro was incredibly deadline-conscious, ensuring that all tasks within his control were completed promptly and efficiently.

Pavel Calado

Director of Software Engineering at Gympass

Pedro is a dedicated worker and an excellent team member, not only promptly carrying out the needed tasks as also actively cooperating with the remaining team members. In addition, Pedro has shown himself resourceful and inventive. These characteristics, together with a firm dedication to his work, allowed him to implement and test all suggestions, while also taking his own decisions and defending their outcomes. At a personal level, Pedro is a communicative person, who gets along very well with all those he has worked with. From what I could observe, he is well respected by both his colleagues and his teachers.

Full letter of recommendation from Pavel

Kassem Hussein

Senior Data Scientist

During the time I worked with Pedro he was involved in 2 projects, one for automating DAGs in a Machine learning model and another for exploring new techniques to detecting anomalies. In both projects Pedro performed with great excellence and minimal supervision.

Experience

Data Engineer II - https://www.tripadvisor.com/, PT

August 2024 - Present

Automated query processes saving 103 hours monthly with 1095% ROI

Data Engineer - https://www.carpe.io/, PT

November 2022 - April 2024

Carpe Data leverages social media, online content, and other forms of alternative data to gain a deeper insight into the risk of insuring companies and individuals.

Automated multiple Spark data pipelines with over 50TB of data and over 70 steps using Airflow, which ran in parallel. This automation saved the company money, as manual queuing of each step on AWS was no longer necessary.
Architected and implemented a scalable 10TB Spark data pipeline orchestration system.
Built new data pipeline steps and respective integration and unit tests in Scala to integrate new data sources into a 40TB pipeline.

Airflow, Python, AWS EMR and Github.

Data Scientist - https://gympass.com/, PT

August 2021 - November 2022

Prototyped a geospatial deep learning recommendation system for Gympass using an unsupervised machine learning GNN architecture, aiming to replace the current extremely limited 3rd-party recommendation system.

Built, trained, and evaluated an unsupervised machine learning algorithm (Graph Neural Network) on AWS.
Tracked model experiment metrics using MLFlow.
Extracted, cleaned, scaled, and encoded geospatial production data for the system with a PySpark data pipeline (NLP).
Analyzed data on a 170TB SQL PrestoDB database to facilitate data-driven decision-making.
Built and deployed a Docker image server.

Backend Developer - https://www.vodafone.com/, PT

July 2020 - September 2020

Built a production REST API, with about 270 requests on average per second, for consumer promotions to replace legacy version on a large scale enterprise EDW with NodeJS + Express + Oracle DB and Jira for task management.

Software Engineer - https://forecastit.pt/, PT

April 2017 - September 2017

Model Information Systems with (ER) Diagram using Oracle SQL Developer Data Modeler for experimenting

Working in a Large Scale Scrum team

Two weeks sprints
Sprint reviews with stakeholders at the end of the sprint
Retrospective after each sprint