Personal Info
Rodolfo Viana
São Paulo, Brazil
LinkedIn / Github / Email
Summary
Data Engineer with + years of experience in developing and maintaining ETL pipelines and defining end-to-end data architecture. Experienced in managing and evolving data warehouses across different cloud providers, ensuring data quality, governance, performance, and reliability. Proven leadership in multidisciplinary projects and initiatives, focused on value delivery and scalability. Data Science specialist from ESALQ-USP, currently pursuing a Master's degree in Computer Science at UNESP.
Professional Experience
⟶ Senior Data Engineer @ Farfetch
[August 2021 — present]
Responsibilities:
- Designs and implements ETL pipelines and automations to ensure data availability, integrity, and efficiency, supporting the operation of 200+ processes
- Builds applications such as web scrapers and parsers to ingest external data
- Conducts in-depth analyses to validate data consistency and reliability
- Produces ad hoc studies and reports
Key contributions:
- Led a data infrastructure optimization project that reduced processing time by 63%, increased performance (IOPS) by 88%, and generated 67% savings in monthly costs
- Mentored and trained more than 10 professionals, strengthening their technical skills
⟶ Lecturer @ Universidade de Marília
[January 2026 — present]
Responsibilities:
- Teaches "Autonomous Systems and Intelligent Agents" to undergraduate Artificial Intelligence students
- Teaches "Introduction to Artificial Intelligence" to students across all IT programs at the institution
- Plans and develops course materials, including lectures, assessments, and supporting resources
⟶ Lecturer @ IDP
[July 2021 — December 2025]
Responsibility:
- Taught "Introduction to Programming with Python" and "Web Scraping" in postgraduate programs, training 100+ professionals
⟶ Senior Data Scientist @ Rede Globo
[December 2018 — August 2021]
Responsibilities:
- Developed and implemented predictive models — including classification algorithms and neural networks — for business intelligence and content personalization
- Performed exploratory data analysis and statistical modeling to generate story ideas and inform editorial planning
- Led the design and implementation of ETL pipelines and large-scale web scraping processes to ingest external data for analysis
Key contribution:
- Conceived and implemented a system that delivered real-time SARS-CoV-2 statistics to 500+ journalists, expanding news coverage and increasing viewer engagement by 41%
Education
⟶ M.Sc. in Computer Science @ Ibilce-Unesp
2025 — in progress
- Ongoing research: Enhancing edge detection in U-Net architectures for medical image segmentation
- Advisor: Prof. Wallace Correa de Oliveira Casaca, PhD
⟶ MBA in Data Science and Analytics @ Esalq-USP
2022 — 2023
- Capstone project: Anomaly detection in São Paulo state deputies' expenses using K-Means Advisor: Prof. Ana Julia Righetto, PhD
Technical Skills
Cloud Platforms
Google Cloud Platform (GCP), AWS, Microsoft Azure
Data Engineering and Orchestration
Docker, dbt, Databricks, Airflow, Spark, Google Cloud Functions, AWS Lambda, Apache NiFi, Pentaho, Git, Terraform
Machine Learning and AI
PyTorch, TensorFlow, Statsmodels, PyCaret, scikit-learn
SQL and Databases
Google BigQuery, Amazon Redshift, Azure Synapse, Snowflake, Microsoft SQL Server, MySQL, PostgreSQL
Visualization
Looker, Google Data Studio, PowerBI, Tableau
Programming
Python, SQL, shell scripting
Languages
Portuguese (native), English (fluent)
Interests
Machine Learning and AI
Computer vision, Scientific Machine Learning, Deep Reinforcement Learning
Mathematics and Modeling
Bayesian statistics, Gaussian processes, differential-equation-based modeling
Computer Science
Data structures and algorithms, bio-inspired computing, software engineering for data products
Domains
AI in healthcare (medical imaging, computer-aided diagnosis), transparency and ethics in AI (accountability, bias, governance)