Hymavathi Gummudala

(812) 974-2574 · hygumm@iu.edu

"Hello, I'm a data scientist who graduated recently in Data Science from Indiana University. My passion lies in crafting solutions amidst the complexities of big data. Seeking full time opportunities in data science roles with teams tackling intricate challenges, crafting impactful solutions, and fostering a culture of collaboration, support, and ambition.

Welcome to my portfolio, where data-driven insights meet innovative solutions!"


Experience

Statistical Analyst (Lab Assistant)

Indiana University Indianapolis
  • Supervisor: Sylvester Inkoom, Elie Salomon
    • Facilitated student comprehension of statistical methodologies, such as ANOVA, t-tests, chi-squared tests,classification and regression analysis, through hands-on instruction and coding exercises.
    • Course: INFO-I 415
    September 2023 - August 2024

    Research Analyst

    Indiana University Indianapolis
  • Supervisor: Sunandan Chakraborty
  • Causal Analysis Project

    Link to Repository:

    • Collected and preprocessed data from diverse sources (i.e, Huggingface, Github etc) , converting unstructured Text, JSON to CSV.
    • Executed a 70-30 data split, facilitating comprehensive analysis across 110,000 sentences.
    • Achieved an 81% accuracy with the advanced sequential algorithm
    • Conducted detailed analysis, identifying instances of prediction flips and critical words.
    • Executed parts of speech tagging using Spacy to analyze and determine the percentage distribution of each grammatical category
    • Researched and completed a literature survey on mood changes regarding weather, Data Integration, Bio-medical Literature.
  • Supervisor: Sunandan Chakraborty, Little Lee
  • Text Extraction Project

    Link to Repository:

    • Spearheaded extraction and analysis of legal Latin phrases and case data from Supreme Court of Indiana documents, utilizing Python and Spacy techniques.
    • Orchestrated comprehensive data preprocessing, converting PDF documents to text format and meticulously cleansing irrelevant information for streamlined analysis
    • Executed exploratory data analysis (EDA) to uncover key insights, visualizing trends and patterns to inform strategic decision-making processes.
    • Produced detailed reports and visualizations, effectively communicating findings to stakeholders and guiding informed business strategies.
    August 2022 - May 2024

    Graduate Teaching Assistant

    Indiana University Indianapolis
  • Supervisor: Elie Salomon
    • Evaluated students' performance and offered feedback on their assignments, projects, and inquiries.
    • Reviewed class materials, assignments, and labs for a group of students.
    • Participated in weekly meetings with the professor to analyze students' progress.
    • Course:INFO-H 516
    September 2022 - August 2023

    Data Analyst

    Accenture
    • Collaborated on designing and maintaining KPI-focused logistics dashboards.
    • Conducted comprehensive data monitoring and quality audits, employing advanced visualization tools to ensure data integrity and operational accuracy.
    • Streamlined task management processes by implementing JIRA workflows.
    January 2021 - July 2022

    Associate Data Analyst

    Accenture
    • Gathered and merged data from diverse sources into a standardized format, optimizing it for subsequent analysis.
    • Engineered SQL data models, optimizing logistics data structures for enhanced analytical capabilities and system efficiency
    • Processed and transformed large logistics datasets with over 100 columns from multiple sources
    • Automated logistics data pipelines.
    January 2020 - December 2020

    Education

    Indiana University Indianapolis

    Master of Science
    Applied Data Science

    GPA: 3.94

    Coursework:
    • Deep Learning | Cloud Computing for Data Science
    • Database Management | Data Visualization
    • Informatics | Natural Language Processing
    • Data Analytics | Statistical Inference

    August 2022 - May 2024

    Indian Institute of Information Technology, Design and Manufacturing

    Bachelor of Technology
    Electronics and Communication Engineering

    GPA: 3.5

    Coursework:
    • Fundamentals of Computing | Data Structures and Algorithms
    • Business Analytics | Mathematics for continuous Domain
    • Mathematics for Discrete Domain

    August 2015 - May 2019

    Skills

    Technologies I am familiar with:
    Workflow

    Projects

    English to Telugu Machine Translation

    Adapted the BERT Fused Model for parallel English-Telugu sentence pairs, achieving a BLEU Score of 26.75.

    Designed and implemented preprocessing steps, including tokenization and normalization, tailored for linguistic peculiarities in the dataset.

    Modified preprocessing scripts and normalization techniques, such as byte pair encoding, to suit Telugu language-specific characteristics.

    Neural Machine Translation | Byte Pair Encoding | Language-Specific Preprocessing | BLEU Score

    January 2024- May 2024

    AI Persona Detector

    This open-source tool is designed to detect whether text content has been generated by AI algorithms or authored by humans.

    Deep Learning | Machine Learning

    March 2024- March 2024

    Cloud-Based Generalized Data Analysis Platform

    Designed and implemented a highly scalable cloud application with AWS services for unlimited user scalability.

    PySpark | Python | AWS Glue | AWS Lambda |AWS S3

    August 2023 - December 2023

    Life Years Lost in the USA

    Created an interactive website visualizing U.S. suicide trends, enhancing data engagement.

    Plotly | Seaborn | Matplotlib | D3.js | Html

    August 2023 - December 2023

    Cancer Management Database

    Designed and developed a database system utilizing SQL and data modeling techniques to store and analyze 20 years of cancer treatment and patient data from CDC WONDER.

    Excel | MySQL | Tableau

    January 2023 - May 2023

    Job Matching Using Neural Network Approach

    Built a Resume - Job Matching description algorithm with Neural Networks

    PyTorch | Keras | Tensorflow |sklearn | NumPy | Transformers

    January 2023 - May 2023

    Prediction of Survival of Patients in the ICU

    identify and predict the survival of patients using demographic data, medical history and EHR (Electronic Health Record) records.

    Achieved 97% accuracy through diverse classification algorithms, optimized Random Forest for an 8% boost, and employed SMOTE to balance the dataset for improved model performance

    Scikit-learn | XGBoost | Pandas | NumPy | Matplotlib

    August 2022 - December 2022

    Track A Tool

    Collaborated with a team of 5 in building human activity recognition model using Tensor flow

    Designed and built a Beagle Bone Black based device to track objects

    Surveyed and collected dementia patient data from hospitals to understand their forgetting patterns

    September 2017 - April 2018

    Awards & Certifications

    Description of your image