A data scientist & activist. A feminist. A latina. A speaker.
Django, NLTK, Regex,Pandas, Matplotlib, Scikit Learn, Numpy, BeautifulSoup4, Selenium, Gensim [LDA, Word2vec, Glove]
tidyverse, caret, modelr, MASS, tidytext, stringr, lubriate, acs, Rsocrata, glm, Shiny, leaflet, survey
Redshift, Snowflake, Postgres
S3 buckets, Redshift, AWLS CLI, EC2
Worked with both supervised and unsupervised methods. Experience has mostly centered on Natural Language Processing using word embeddings and dimension reduction on text to understand and extract semantic information through Word2Vec and Glove. Also have experience in topic modeling with Latent Dirichlect Allocation. Other methods have centered around generalized linear models, random forest, KNN, PCA, etc. Implement feature importance,residuals analysis, measuring overfitting vs underfitting, sampling, cross validation, hyperparemeter tuning, ensembling models, bias vs variance tradeoff etc.
AFSCME is the nation's largest public service employee union. Improved an inhouse model on the likelihood of someone to join the union by 15% accuracy with 1/10 of the features and improved computation time from days to minutes. Working with vendor data like voter files from Target Smart and Catalist. Experience with Hustle, NGP VAN, and Action Network. Overseeing our model productions and polling analysis.
Worked in the Travel Operations / Customer Experience at a startup that provided a travel management solution for businesses. Worked as the data analyst to create dashboards monitoring our call center metrics, while understanding semantic customer needs through natural language processing, and making sure that our call center is efficiently staffed to meet service level agreements through forecasting.
Consulted for the website of National Geographic and their ecommerce online store. Utilized Domo as a business intelligence tool to create dashboards across website, social, and customer data. Deployed A/B testing for the ecommerce websites and managed digital campaigns across email, facebook, seo, sem and affiliate marketing.
Consulted on digital reporting, business developments, and provided data for media kits/ ad proposals for brands such as TLC, Animal Planet, Discovery Channel, Discovery Kids, and Home & Health.
Demoed code about Python's BeautifulSoup4 and Selenium for 3 different organizations.
Demoed code about Python's Package Pandas for Data Analysis
Demoed Python code on how to construct if else and while statements
Demoed Functions vs Methods (Object Oriented Programming)
Fundamentals of an R Shiny web app
Research Assistant January 2019-Present under the supervision of Dr.Boukouvalas
Textbook Award Recipient 2020