Luigui Gallardo-Becerra

Computational Biologist and Software Engineer with 6+ years of experience building backend systems, automating complex workflows, and designing scalable data pipelines. Proficient in Python, C#, ASP.NET Core, and SQL, with a strong background in bioinformatics, HPC environments, and cloud-native development. Contributed to 14 peer-reviewed publications through machine learning models and large-scale multi-omics analysis.

Below are the links to my LinkedIn, GitHub, and ResearchGate profiles. You can also download my resume here.

Página con tutoriales de bioinformática en español.


Experience

Graduate Research Assistant (Data Science & Bioinformatics)

January 2019 - July 2025

Led and contributed to genomics and microbiome research projects by integrating molecular biology workflows with scalable computational pipelines and web tooling to support high-impact scientific findings.

  • Developed reproducible NGS analysis pipelines (Python, R, Bash, Snakemake, Nextflow) for metagenomics, metatranscriptomics, RNA-seq, and 16S datasets, reducing analysis time from days to hours.
  • Analyzed high-throughput sequencing datasets (>2 TB) using HPC and Linux systems, optimizing compute environments and improving workflow efficiency for 10+ research members.
  • Designed and implemented custom ETL and data-processing tools (Python, R, Bash, Perl) to solve research-specific challenges in microbiome and microbial genomics studies.
  • Used Git and GitHub for version control, documentation, and collaborative development in multidisciplinary teams.
  • Co-authored 14+ peer-reviewed publications in journals including Microbial Ecology, Scientific Reports, and Genes, contributing to experimental design, data analysis, and manuscript preparation.

Data Engineer

October 2016 - December 2018

Worked as an independent contractor supporting data sourcing and annotation projects for clients in technology and machine learning.

  • Collected, curated, and preprocessed structured and unstructured datasets for ML and analytics teams.
  • Prepared weekly deliverables and maintained documentation for project managers and clients.
  • Provided translation and linguistic annotation (Spanish–English) to support data quality requirements.

Software Engineer (Internship)

May 2016 - July 2016

Assisted in the development of a clinical record management web application.

  • Built REST APIs using ASP.NET Core (C#) and implemented frontend components with Angular, HTML, CSS, and JavaScript.

Data Scientist | Bioinformatician (Internship)

January 2016 - July 2016

Supported a microbiome research project analyzing bacterial community composition under different aquaculture conditions.

  • Processed and analyzed 16S rRNA sequencing datasets using Linux, HPC environments, Bash, and R.
  • Developed ETL scripts (Python, Bash, Perl) for preprocessing and feature extraction.
  • Generated visualizations and statistical analyses for publication and presentations.
  • Co-authored a peer-reviewed article published in Nature Scientific Reports.

Mathematics Teacher

January 2014 - December 2015
Student Coaching

Provided part-time mathematics instruction for high school and college-preparatory students.

  • Developed custom lesson plans and provided individual and group tutoring.
  • Designed weekly assessments to measure and improve student performance.

Research & Portfolio

Below is a selection of peer-reviewed publications I have co-authored. For each work, I briefly describe my main contributions and provide a link to the published article.

Applications and Pipelines

LittleVet – Veterinary Clinic Management Web Application

  • Developed using Python, Django, HTML, and CSS.
  • Designed a full-stack system for managing clients, pets, appointments, and clinic records.
  • Implemented secure authentication and structured data models for scalable workflow management.

DESeq2 Differential Expression Analysis Tool

  • Developed using R, Shiny, and RStudio.
  • Automates differential expression analysis from metadata and read-count matrices.
  • Generates publication-ready volcano plots and correlation visualizations (SVG format).

Pearson and Spearman Correlation Analysis Tool

  • Built using Shiny and R for interactive correlation analysis.
  • Accepts tabular input and computes Pearson or Spearman correlations with statistical significance.
  • Provides downloadable, publication-ready correlation plots in SVG format.

Nextflow Pipeline for 16S rRNA Profiling Analysis

  • Developed using Nextflow to automate reproducible 16S rRNA workflows.
  • Processes Illumina paired-end reads through QC, OTU clustering, taxonomy assignment, and diversity analyses.
  • Generates standardized outputs compatible with downstream visualization platforms.


Education

National Autonomous University of Mexico (UNAM)

Ph.D. in Computational Biology
Thesis Research
January 2019 – July 2025

National Autonomous University of Mexico (UNAM)

Master of Science in Computational Biology
Thesis Research
August 2016 – January 2019

University of Guadalajara (UDG)

Bachelor of Science in Molecular Biology
Thesis Research
August 2012 – January 2016

Skills

Software Engineer with a strong background in backend development, cloud-native solutions, and scalable data systems. Experienced in building web applications with ASP.NET Core, designing ETL pipelines, and automating complex workflows. Fast learner with excellent problem-solving abilities and strong adaptability across diverse technologies.

Programming Languages & Tools
  • C#, ASP.NET Core, Python, SQL, Bash, JavaScript, R, HTML/CSS
  • Architecture: Clean Architecture, REST APIs, ETL Pipelines
  • Linux, Docker, AWS, Git/GitHub, CI/CD workflows
  • Databases: SQL Server, MySQL, PostgreSQL, Dapper
Bioinformatics & Computational Biology
  • NGS workflows: RNA-seq, WGS, 16S, metagenomics, metatranscriptomics, viromics
  • Pipeline development with Snakemake, Nextflow, Bash, Python, and R
  • Genome & transcriptome assembly (Trinity, SPAdes) and annotation
  • Differential expression, taxonomic profiling, pathway analysis
  • Sequence alignment & QC: Bowtie2, HISAT2, BWA, samtools, fastp, Kraken2
  • ETL automation and custom tool development for multi-omics data
Data Engineering & Cloud
  • ETL pipeline design and automation (Python, Bash, Apache Spark)
  • HPC / SLURM, Nextflow, Snakemake workflow orchestration
  • AWS cloud infrastructure, containerization with Docker
Languages
  • English – Full professional proficiency
  • Spanish – Native

Interests

I enjoy outdoor activities such as hiking, camping, and mountain biking, which help me stay active and connected to nature. At home, I appreciate classic films and television series as a way to unwind. I also dedicate time to exploring new developments in informatics, biotechnology, and scientific research, continually expanding my knowledge in these fast-evolving fields.


Awards & Certifications

  • FullStack Academy – Software Development Bootcamp (2023)
  • Google Data Analytics Specialization (2022)
  • Data Scientist: Machine Learning Specialization – Codecademy (2021)
  • Maximum Cum Laude – Bachelor’s Degree (2017)
  • First Place – XVI State Contest of Physics Devices and Experiments (2011)
  • Honorable Mention – ExpoCiencias Nacional, Tlaxcala (Science Dissemination Project, 2010)
  • Second Place – XXIII Mexican Mathematics Olympiad, State Competition (2010)
  • Third Place – XIX University Mathematics Olympiad (2009)
  • Second Place – XXI University Mathematics Olympiad, State Competition (2009)