Cecilia “Keeko” Villaveces
(706) 351-8798
Keeko.villaveces@gmail.com
github repo: https://github.com/keekov
Education
Bachelor of Science, December 2018
Department of Mathematics, University of Georgia, Athens
Computational skills
Operating systems: Microsoft Windows, GNU Linux
Programming languages: R, C, Java, Python, UNIX Shell (bash), Mathematica, LaTex, SageMath, Matlab Data visualization: R ggplot2 and igraph, Python seaborn, matplotlib, PowerBI
Other: Singularity, Docker,SQL, Advanced Excel, GNU Image Manipulation Program,Anaconda, Slurm,Torque GLOBUS, EasyBuild,Kdenlive
Cloud Computing: certified AWS cloud practitioner and AI practitioner, Google Cloud, Azure
Work Experience
October 2022 - January 2025 Guidehouse
Supervisor: Elena Maximova
Position: Tech Consultant
August 2019 - July 2022 Georgia Advanced Computational Resource Center
Supervisor: Gui Cormier
Position: Bioinformatics Specialist
● Provided academic consultation and direct technical assistance to users of the Sapelo2 supercomputer by instructing them how to use the supercomputer’s resources. This required proficiency in shell scripting(bash), use of SLURM, and knowledge of supercomputing workflows.
● Installed and tested various software for users, mostly using easybuild. In particular, I became most experienced with bioinformatics software and pipelines such as those used in gene sequencing. (FastQC,ABySS,Canu,Spades,SSPACE, BEAST,AUGUSTUS,Blast,Bowtie,HMMER etc.)
● Installed software by creating a conda environment, or docker/singularity container. Containers and environments were built from scratch when versions from the author were unavailable. Provided technical advice to users who wanted to start using environments or containers.
● Taught training sessions monthly on coding languages (R and Python), a training session on using Linux and a Training session on using Sapelo2.
● Advised users on improving software efficiency by utilizing various forms of parallelization, (OpenMP, OpenMPI, R multithreading, R parallel package) and GPU (CUDA, OpenACC)
August 2019 - June 2020 Centers for Disease Control, Global Immunization division
Supervisor: Abhijeet Anand
Position: Data Manager (FTE contractor through Scimetrika)
● Conducted data analysis using the R programming language, which involved the acquisition, cleaning, storage, maintenance, analysis, visualization and interpretation of data.
● Worked closely with epidemiologists to create code suiting the needs of the team, which consisted of a variety of coding backgrounds. Provided technical assistance and project consulting to others in the Polio Eradication Branch as well as the Infectious Diseases Department
● Created easy to read, customizable, and reusable templates for our division for reports, using various data visualization tools. For example using Rmarkdown to create a Shiny application, pdfs, and excel sheets.
● Leveraged math,statistics, chemistry, and data analysis background to determine the best algorithms and software to use for modeling and analysis, while being able to explain the pros and cons of each method to those who would like the freedom to make a more informed decision without experience with higher math.
Projects
2019 -Present Textbook figures and editing
I co-authored chapter 14 in the book Computational Methods and Data Analysis for Metabolomics (ISBN-13: 978-1071602386).This chapter is about using python for data analysis. I also provided substantial editing to other chapters. My specific supplementary code for chapter 14 can be found here:
https://github.com/keekov/Springer under “Chapter 14 - Python example code.ipynb”
This code uses data visualization to analyze mass spectrometry data from an experiment on cells infected with yellow fever.
2021
Designed a two day training session to teach R to students and faculty at the University of Georgia. Topics covered basic R syntax as well as parallelization.
Spring/Summer 2022
I am Co-writing class material on statistical data analysis and visualization for a “coding for business majors” class taught at the University of Arkansas. Given a weekly topic, I write, test, run, debug, edit and comment code with the professor of the class to get the code ready for the students to run in a virtual lab. Most of the coding is done in R and includes many figures done in base R and ggplot.