Resume
Basics
| Name | Gabriel Zhang |
| yuanzzhang3@gmail.com |
Education
-
Sep 2023 - Dec 2024 Evanston, IL
Master's degree
Northwestern University
Machine Learning and Data Science
- Foundation Models
- Natural Language Processing (NLP)
- Deep Learning
- Predictive Analytics
- Cloud Engineering
- Data Mining
-
Aug 2019 - Jun 2023 Irvine, CA
Bachelor's degree
University of California, Irvine
Data Science
- Machine Learning
- Big Data Analytics
- Probability and Statistics
- Algorithms
- Information Retrieval
Work
-
Jul 2024 - Sep 2024 Waltham, MA
Data Science Engineer Intern
SS&C Intralinks
- Engineered a language-identification POC to boost OCR accuracy for multilingual text in images
- Evaluated 30 vision-language models based on accuracy, processing latency, and memory efficiency on AWS EC2
- Built efficient Python pipelines for text-rich image detection and CLIP-based language identification, achieving 99% top-1 accuracy at 0.15s/document latency
- Projected $300K annual cost savings by reducing dependency on third-party OCR APIs
-
Jul 2021 - Aug 2021 Shanghai, China
Data Analyst Intern
Shanghai Daiqian Information Technology Co., Ltd.
- Developed a feedback tracking system to streamline feedback collection with automated reminders using YiDA (Alibaba Cloud), reducing collection time by 65% and expanding testing capacity by 50%
- Conducted customer segmentation and product performance analysis to enhance customer experience, designing 20+ data visualizations using R and ggplot2
Projects
Awards
- 2023
Cum Laude
University of California, Irvine
- 2023
Skills
| Machine Learning & AI | |
| LLM [Hugging Face Transformers, LangChain, OpenAI API] | |
| NLP [NLTK, Sentence Transformers, BERTopic, SpaCy] | |
| DL/ML [PyTorch, TensorFlow, Keras, Scikit-learn, XGBoost, Statsmodels, MLflow] | |
| Core [Pandas, NumPy, OpenCV] |
| Data Platforms | |
| Spark | |
| Databricks | |
| Hadoop | |
| Pinecone | |
| BigQuery | |
| MongoDB | |
| Neo4j | |
| Cassandra | |
| PostgreSQL |
| AWS Services | |
| SageMaker | |
| EC2 | |
| ECS | |
| S3 | |
| RDS | |
| Lambda | |
| SQS |
| MLOps | |
| Docker | |
| Git | |
| GitHub Actions | |
| CI/CD | |
| Linux | |
| Unit Testing | |
| Flask | |
| REST API |
| Data Visualization | |
| Tableau | |
| Excel | |
| Seaborn | |
| Matplotlib | |
| ggplot2 |
Volunteer
-
Jun 2022 - Oct 2023 Software Developer
Irvine Canaan Christian Community Church
Architected a secure children's check-in/out system via barcode verification