The Sexiest Job of the 21st CenturyKatie Tran
Analytics & Data Pro

George Mason University Alumni
Computational and Data Science - Intelligence Studies - Accounting
SQL, MySQL, Python, R, Tableau, Excel, ...
Baltimore, Maryland, USA

Projects

Data analytics in business helps identify areas to focus on, process information, and review past revenues/performance efficiently.


Resume

I'm an aspiring data analyst passionate about helping organizations turn data into actionable insights. From defining key metrics to capturing accurate data and presenting clear findings, I support informed decision-making and effective stakeholder communication. With experience in data analysis and system development, I'm ready to help you unlock the full potential of your data and extract insights and knowledge from vast amounts of data.

Intelligence Expose

  • Collect and evaluate data from diverse sources—including law enforcement databases, surveillance, intelligence networks, and GIS—to identify patterns and help prevent organized crime and terrorism.


Work With Me

I am your own personal data pro to uncover unvaluable insights to take your company to the next level.

© Copyright. Katie M Tran. (2025)

Grammy Project

Python, Tableau, Pandas, Plotly.express, 2 datasets, 3 dataframes, bounce_rate, sum(), concat(), join, visualization, merge(), drop(), ...
Exploratory Data Analysis, Data Visualization
In this project, I worked on real data from both websites owned by The Recording Academy, the non-profit organization behind the famous Grammy Awards. Ray Starck, the VP of Digital Strategy, decided to split the websites into grammy.com and recordingacademy.com to better serve the Recording Academy's various audience needs.
Task:
Examining the impact of splitting up the two websites, and analyzing the data for a better understanding of trends and audience behavior on both sites.

Result

-----------------------------------------------------------------------------------------------------------------

Intel Data Center Sustainable project

SQL, Tableau
Exploratory Data Analysis, Data visualization
Intel, the semiconductor manufacturing powerhouse, is planning on building a new data center. Energy availability and usage are some of the key considerations in deciding on a location of the data center. For example, which regions produce a surplus of energy, and are therefore more likely to provide energy at cheaper prices? Which regions rely more on renewable energy sources?
Task:
Co-designed with Intel's Sustainability Team, I wrote SQL queries that power the analysis and create visualizations that helped the Intel team select the best location for the new data center.
Result:

----------------------------------------------------------------------------------------------------------------

Amazon Product Review Analysis & Data Visualization Project

Python, Natural Language Processing (NLP), Web ScrapingData Amazon reviews data was collected from 2013 to 2017.Reviews are still important for marketing products on Amazon.com, its reviews specifically rate whether the product met the customer’s expectations.Preprocessing
The processing of Web Scraping:
query data directly from API to get the dataset in csv.file:
1. Inspected the XHR network section of the URL to get URL
2. Input data into Python to create a data format I want
DATA ANALYSIS METHOD IN PYTHON:
• Loaded dataset in csv file into Python
• Used nltk function to stop unnecessary words
• Performed lemmatization,
• Extracted sentences to words,
• Dictionary to look up words and their frequency,
• Built a topic model,
• Computed model and coherence score
• Judged how good the given topic model is,
• Coherence score: 0.41,
• Data visualization.
Analyzed Amazon product reviews in physical stores (Wholes Food) II.
Analyzed Walmart product reviews and online shipping policy III.
More Action:
Compared the net sales of both pre and post covid-19 era
The reviews were picked randomly and the corpus has nearly 1600 reviews of different customers. The dataset was 27 columns including 99% of Amazon product brand and 1% of Moshi.
Methods / Analysis:
Using Natural Language Processing (NLP) to analyze texts, allowing machines to understand how humans speak.
The results of this Natural Language Processing (NLP)
Steps to Practice NLP:
• Most popular reviews: blue devices, long-time battery tablets, headphones, sound devices.
• Positive reviews increased sales by 20%
• Walmart had bigger net sales than Amazon for everyday basics.
• Post Covid-19, people love to go to physical stores.
• Increased the volume of visits in Walmart.
• Amazon’s sale growth was slowing down.
Conclusion:
Positive reviews were strongly converting traffic to product listing and increasing traffic that comes from the search engine result page (SERP) on Amazon. Online sales around the world hit $2 trillion per year, a product that had just one review is 65% more likely to be purchased than a product that had none, according to Power Reviews.
The impact of reviews was immense in sale revenues. Reviews could be an important part of product listing optimization strategy, not only in converting traffic that came to product listing, but also in increasing or decreasing traffic that came from the search engine result page (SERP).
Result:

----------------------------------------------------------------------------------------------------------------

Hospital Database Management System MySQL queries Project

Hospital Database Management System MySQL queries ProjectThis project is hospital management system, which is being created by myself using MySQL and queries to retrieve the data for analyzing. The schema and the data were inserted into the database with many tables of data. The database to be developed will consist of the following tables: Nurse, Department, Physician, Patient, Room, Prescription, Appointment, Procedure, Trained-in, affiliated with, Medication, Stay, on-call, Undergoes. The goal is to get real-time experience dealing with the kinds of challenges healthcare workers can find in a real hospital, like keeping track of patients and appointments.The system is used strictly by employees of hospital only. There are several locations of hospital and many relational tables of data. It creates a systematic and standardized record of Patients, Doctors, and Rooms, which can be controlled only by the administrator. The system should be able to query information about room and patient assignments for each employee of a department.
Result:

Intelligence Expose

Intelligence Analysis Projects | technical skill, critically thinking, research analysis, analytical skills | 01/2022 – 05/2025Jonathan Luna case | An Intelligence Analysis Report | 2022
Critically thinking, evaluated, and synthesized complex information from multiple sources
Why these healthy people were suddenly dying in military | An Intelligence Analysis Report | 2022
Presented clear, concise, and actionable insights in a structured written report that highlights key findings, implications.
The death of Captain Hess | An Intelligence Analysis Report | 2023
Transformed raw data into meaningful intelligence through strong analytical and communication skills.
Clyde Conrad | Intelligence Failure Analysis report | 2025
o Examines and dissect instances where intelligence gathering and analysis processes failed.
o Identifies the root causes, contributing factors, and potential systemic issues to prevent future occurrences
o Requiring strong critical thinking, data analysis, and effective communication skills to present findings in a comprehensive report.

Resume

Katie M Tran
Severn, MD 21144, United States,
[email protected],
https://www.linkedin.com/in/katie-tran-647ba7308/,
https://github.com/Katiedtran25?tab=repositories
SummaryAnalytical Thinking
Project Management
Collaboration
Problem-Solving
Data Analysis
Data Extraction
Data Cleansing
Data Normalization
Data Preparation
AI/ML Frameworks
Prompt Engineering
Predictive Modeling
Natural Language Processing (NLP)
Dashboards
Executive Summaries
Communication Skills
Collaboration Tools
Cloud Platforms (AWS, Azure, GCP)
Apache Spark
Real-World Datasets
Python
Pandas
NumPy
TensorFlow
PyTorch
ChatGPT
Gemini
Work Experience
06/2025 – Present | AI/Data Science Intern | Victoria Solutions | Remote
• Analyzed real-world datasets and delivered insights using Python, Pandas, and NumPy.
• Built predictive models for churn forecasting and fraud detection using TensorFlow and PyTorch.
• Cleaned and prepared unstructured data for analysis in fast-paced, client-simulated environments.
• Integrated AI tools like ChatGPT and Gemini to automate reporting and enhance data interpretation.
• Applied prompt engineering strategies to optimize AI-driven workflows.
• Created dashboards and executive summaries to communicate findings to stakeholders.
• Gained exposure to cloud platforms (AWS, Azure, Google Cloud) and distributed tools like Apache Spark.
01/2025 – 05/2025 | SQL & Python trainee | The Global Career Accelerator | Remote
• Collaborated with global teams on data projects for top partners including Intel, Grammys, Uber, Spotify, and TikTok.
• Extracted, cleaned, and analyzed large datasets using SQL and Python to uncover strategic insights.
• Engineered interactive Tableau dashboards to visualize user behavior and key industry trends.
• Delivered accurate, data-driven results under tight deadlines through effective cross-functional teamwork.
02/2023 – 09/2023 | Product Manager | Self-directed | Severn, Maryland
• Analyzed contract data using QuickBooks, achieving a 70% reduction in surplus inventory while optimizing proposal terms.
• Spearheaded the renovation of two 5,000 sq. ft. smoke-damaged homes, elevating project value.
• Supervised three contractor teams, streamlining handling time by 20%, slashing project costs by 50%.
• Managed contracts, schedules, and cross-trade coordination to ensure consistent on-time delivery of urgent projects.
Extracurricular & Certificates
July 2025 | Wells Fargo Consumer, Small and Business Banking Job Simulation on Forage Remote
• Completed a job simulation focused on analyzing customer trends and creating strategic solutions for Wells Fargo’s Consumer, Small & Business Banking (CSBB) team.
• Utilized financial data to align customer needs with appropriate business divisions and tailored products and services.
• Developed strategic recommendations to enhance customer satisfaction and drive business outcomes, showcasing skills in financial analysis, decision-making, and professional communication.
Projects
March 2025 | Sustainability Intel Center Project | SQL & Tableau |The Global Career Accelerator
• Analyzed sustainability-related business data to uncover insights and opportunities.
• Evaluated Intel datasets to identify patterns, trends, and anomalies in national security data.
• Utilized Python and SQL to clean and process large datasets, improving efficiency by 30%.
• Created interactive Tableau dashboards to visualize key insights, aiding strategic decision-making.
• Delivered data-driven recommendations aligned with Intel’s corporate sustainability goals.
• Strengthened skills in real-world consulting, sustainability metrics, and impact measurement.
April 2025 | Grammys Website Audience Analysis Project | SQL & Tableau | The Global Career Accelerator
• Scraped and analyzed historical Grammy Awards data to determine trends in winning artists and genres.
• Utilized SQL to predict future winners with 85% accuracy.
• Designed an interactive Tableau dashboard for data exploration.
• Investigated key questions around content performance and audience engagement using real data.
• Produced data visualizations and reports that informed strategy for digital content, and audience retention.
• Presented findings in a professional setting, applying business communication and storytelling techniques.
Summer 2024 | The Movie Database API Analytic Project | Python, TMDB API Analytic tools| Self-Project
• Identified and presented four top-rated movies based on combined criteria such as rating, votes, and release date.
• Integrated TMDB API to extract and analyze real-time movie data using Python.
• Forecasted usage trends and detected anomalies to anticipate user engagement issues.
• Parsed JSON responses to uncover patterns in ratings, genres, and popularity.
• Identified and presented top-rated films based on custom ranking metrics.
Spring 2024 | Hospital Database Project | MySQL, LaTeX | George Mason University (GMU)
• Engineered a normalized hospital database in MySQL, including ER diagram design, schema creation, and data population.
• Developed and optimized SQL queries, stored procedures, and indexing strategies to improve query performance.
• Modeled complex patient-doctor-treatment relationships to ensure referential integrity and efficient data access.
• Simulated real-world hospital workflows to identify and resolve bottlenecks in patient tracking and scheduling.
• Documented technical implementation and data insights using LaTeX for structured reporting.
Fall 2022 | Amazon Product Review Analysis & Natural Language Processing (NLP) Modeling Project | NLP & Logistic regression | GMU
• Applied machine learning and NLP techniques to analyze sentiment and extract insights from Amazon product reviews.
• Built a sentiment classification model using logistic regression and TF-IDF vectorization to evaluate customer opinions.
• Analyzed Amazon product reviews using Natural Language Processing techniques to extract customer sentiment and behavioral patterns.
• Transformed text data using TF-IDF vectorization to quantify review content for machine learning analysis.
• Identified correlations between review sentiment and product sales performance to inform marketing and inventory decisions.
• Conducted exploratory data analysis and visualized review trends using Python libraries such as Matplotlib and Seaborn.
• Compared 2022 revenue trends of Amazon and Walmart to contextualize product performance within broader retail competition.
Fall 2022 | Northern Virginia Flying Squirrel Habitat Modeling Project| ArcGIS & GIS Analysis| Northern Virginia Community College
• Developed a GIS-based habitat suitability model for the Northern Virginia flying squirrel using micro- and macrohabitat affiliation data across West Virginia.
• Standardized and weighted spatial variables, applying Map Algebra and the Reclassify tool in ArcMap to manipulate and analyze geospatial data.
• Produced a predictive habitat model highlighting elevation and forest cover as key indicators, demonstrating strong accuracy across mountainous regions in West Virginia.
Education
01/2019 – 05/2025 | George Mason University | Fairfax, USA
Bachelor's, Computational and Data Science | Intelligence Studies
• Dean’s list - Spring & Fall 2024
01/2023 – 12/2024 | Northern Virginia Community College| Annandale, USA
Certificate, Accounting (24 academy credits)
• Dean’s list - Spring 2018, Presidential Scholar - May 2018, Honor student with GPA 4.0
• Eligible for CPA exam (150 academic credits)
09/2006 – 05/2010 |University of Finance and Marketing | Vietnam
• Bachelor's, Business Administration
SkillsSQL
Python
R
C
Matlab
SAS
Natural Language Processing
Ubuntu
Linux
Jupyter Notebook
Tensorflow
Scikit-learn
VSCode
Google CoLab
Python PyCharm
Python Spyder
Python Anaconda
ArcGIS
NetLogo
StatCrunch
JMP Pro
MyStatLab
Docker
Tableau
Excel
Access
Power Point
Word
Computational and Data Science Courses:Scientific Data Mining & Visualization
Intro To Computer Science
Computational Science Tools
Computing for Scientists
Intro Social Network Analysis
Modeling and Simulation I & II
Scientific Information/Data
Intro Scientific Programming
Scientific Data and Databases
Geographical Information Systems I
Python Programming
Intro to Computing

About me

Email: [email protected]