Manuel González

I'm

About

Loved computers since I was a child, so I decided that engineering would fit my interests and aspirations. I’ve worked in different roles and companies, which I’ve learned a lot from. Ready for challenges, mainly related to data engineering.

Data Engineer

I've studied Computer Science at University of Extremadura in Spain. Currently, I'm not interested in working as a freelance, but it's not a definitive decision for the future.

  • Degree: Bachelor of Computer Science
  • Email: mannu3@hotmail.com
  • Freelance: Not available

My passions are sport, videogames, reading and listening to music. I think it is important to enjoy the spare time we have. I am also a fan of good communication, good vibes, good working environments, listening and empathy.

Click here to check my GitHub repository containing my projects.

Skills

These are my main skills and strengths which I am comfortable working with, although I am always open to know more tools to build software. You can summarize them into categories related to programming languages and frameworks in data ecosystem.

Python 90%
Machine Learning 80%
Azure Synapse 90%
Apache Spark 80%
Data Analysis80%
Azure Data Factory 80%
Azure DataBricks 90%
Talend Open Studio 75%
SQL90%
Power BI60%

Resume

Here you can check out all my professional experience, the courses and certificates I have completed properly. Each title in Courses and Certificates will lead you to its URL verification.

Sumary

Manuel González

Machine Learning and Data Engineer with 4+ years of experience in Software Engineering field and designing and developing models and ETLs

  • Extremadura, Spain
  • mannu3@hotmail.com

Education

Bachelor of Computer Science

2015 - 2020

University of Extremadura - Cáceres, Spain

Bachelor Degree Final Project related to machine learning techniques (classification and regression for supervised learning) with 10/10 mark

Courses and Certificates

Microsoft Certified: Azure Fundamentals (AZ-900)

July 2024

Microsoft

  • Describe cloud concepts
  • Describe Azure architecture and services
  • Describe Azure management and governance

Microsoft Certified: Azure AI Engineer Associate (AI-102)

July 2024

Microsoft

  • Plan and manage an Azure AI solution
  • Implement content moderation solutions
  • Implement computer vision solutions
  • Implement natural language processing solutions
  • Implement knowledge mining and document intelligence solutions
  • Implement generative AI solutions

Microsoft Certified: Azure AI Fundamentals (AI-900)

June 2024

Microsoft

  • Describe Artificial Intelligence workloads and considerations
  • Describe fundamental principles of machine learning on Azure
  • Describe features of computer vision workloads on Azure
  • Describe features of Natural Language Processing (NLP) workloads on Azure
  • Describe features of generative AI workloads on Azure

Microsoft Certified: Azure Data Engineer Associate (DP-203)

May 2024

Microsoft

  • Design and implement data storage
  • Develop data processing
  • Secure, monitor, and optimize data storage and data processing

Microsoft Certified: Azure Data Fundamentals (DP-900)

May 2024

Microsoft

  • Describe core data concepts
  • Identify considerations for relational data on Azure
  • Describe considerations for working with non-relational data on Azure
  • Describe an analytics workload on Azure

DeepLearning.AI TensorFlow Developer

June 2021

Coursera

  • Build natural language processing systems using TensorFlow
  • Handle real-world image data and explore strategies to prevent overfitting, including augmentation and dropout.
  • Apply RNNs, GRUs, and LSTMs as you train them using text repositories.

Deep Learning Specialization

September 2020

Coursera

  • Build and train deep neural networks, identify key architecture parameters, implement vectorized neural networks and deep learning to applications
  • Build a CNN and apply it to detection and recognition tasks, use neural style transfer to generate art, and apply algorithms to image and video data
  • Train test sets, analyze variance for DL applications, use standard techniques and optimization algorithms, and build neural networks in TensorFlow
  • Build and train RNNs, work with NLP and Word Embeddings, and use HuggingFace tokenizers and transformer models to perform NER and Question Answering

A gentle introduction to Machine Learning using Scikit-learn

March 2020

Udemy

  • Introduction to basic analysis with Python
  • Review of main algorithms used in Machine Learning
  • Review of main workflow for analyzing a dataset with scikit-learn

Professional Experience

Data Engineer

August 2023 - present

The Cocktail Experience

  • Analysis, design and implementation of data models and ETLs in the field of digital marketing, with the objective of carrying out an MVP consisting of a metrics dashboard from a Lakehouse (Data Lake + Data Warehouse) with Azure Synapse (Data Factory, Azure SQL Server and Azure Databricks)
  • Responsible for the migration of several processes related to a Pyspark technology telecommunications project (Spark, Python) to Google BigQuery.
  • Responsible for the migration of a logistics project to Talend Open Studio from Salesforce
  • SCRUM and Agile methodologies.

Data Engineer

August 2021 - August 2023

INNOCV Solutions

  • Design, development and deployment of pipelines and ETLs through Cloud services with Microsoft Azure (Data Factory) for a logistics project, with the objective of extracting GS1 standard files from different SFTP data sources and REST APIs, taking the processed information to a Data Warehouse that I designed and managed with SQL Server, serving to power a Control Tower for the WMS Ditwuit
  • Application of machine learning techniques with Python and libraries related to data engineering (Pandas, NumPy, scikit-learn, TensorFlow and Spark's MLLib) to extract high-value information related to the detection of anomalies on fraud accounts in a banking project.
  • Development and deployment of ETLs with Talend Open Studio for another logistics project related to a migration to the Ditwuit WMS.
  • Development of ETLs in microservices using Apache Kafka, Spark/Azure Databricks in Python and Spring with Java in a banking sector project, processing streaming data related to the management of push notifications
  • SCRUM and Agile methodologies.

Machine Learning Engineer

September 2020 - June 2021

JDani DevOps/SRE/Cloud Freelance - University of Extremadura

  • Researcher as part of the Innovation and Talent Project (PIT) as developer, analyst, machine learning and data engineering roles in collaboration with the University of Extremadura
  • Developed ETL and anomaly detection systems using machine learning and deep learning for collecting system resource metrics (CPU, memory, disk usage, system calls).
  • Used unsupervised learning (DBScan, KMeans, Gaussian distributions, SVMOneClass...), LSTM / RNN neural networks and Natural Language Processing techniques.
  • Main technologies were Python, Scikit-Learn, NumPy, Pandas, Tensorflow, Linux, Docker, ElasticSearch stack and Metasploit suit.
  • Achieved a paper in XVI National Congress of Criptography for Spain, in the following link:
    Anomaly detection in container attacks of software by monitoring and system resource analysis

QA Consultant

January 2020 - August 2020

Optiva Media

  • QA Specialization for video devices and digital TV automatic testing.
  • Writing statistics and bug reports.
  • Network analysis in Wireshark in order to verify packet communication between app's client and server.
  • Design and execution of several test-plans through TestLink platform.
  • Delivered production versions and customer satisfaction achieved as well
  • SCRUM and Agile methodologies.

Front-End Developer Intern

March 2019 - May 2019

Homeria Open Solutions

  • Developed a web application about a board game, as intern during my bachelor's degree internship.
  • Mainly HTML, CSS, TypeScript, Angular 8 and Google Firebase as main back-end.
  • Product delivered to client successfully.
  • SCRUM and Agile methodologies.

My interests

These are the kind of things I like to do, and would like to find when looking for a company to work on. If you are interested, maybe we can get in touch; of course I am willing to learn new stuff!

ETL developing

Build, deploy and mantain extraction, transformation and load workflows in cloud environments with Data Factory and Spark

Data Modeling

Design and implement Data Warehouses, Data Lakes and Data LakeHouses as main data architectures

Machine Learning

I've worked with common ML tools such as scikit-learn and Tensorflow to build ML systems

Python Developing

To develop transformation processes in Spark, Apache Airflow or Talend (with Java)

Databases and SQL

I have experience working with SQL (MySQL, SQL Server, Google BigQuery...)

SCRUM and Agile methodologies

To deliver reliable code following scheduled sprints

Contact

The most common and easiest way to reach me is via LinkedIn. Alternatively you can contact me via email through the form below.

Location:

Extremadura, Spain - remote positions

Loading
Your message has been sent. Thank you!