My_Portfolio

Home page Resume Portfolio Projects Study Projects Certificates Contacts

Vikash Kumar Singh

About me

Hello, I’m Vikash Kumar Singh. I recently graduated with a Master of Information Systems Management from Carnegie Mellon University, where my academic endeavors were grounded in a broad spectrum of coursework, including Database Management, Data-Focused Python, Statistics and Probability, A/B Testing, Telling Stories with Data, and more.

I enhanced my practical experience as a Data Quality Analyst at Sustainible, where I managed extensive datasets to aid entrepreneurs in refining their business models. This role, along with my teaching assistantship in Database Management, underscores my passion for applying analytical acumen and technical expertise to solve real-world problems and drive innovation. My professional path has taken me from Infosys Limited, where I contributed significantly as a Quality Analyst and Engineer, to impactful academic projects at Carnegie Mellon University. These projects have not only honed my technical skills in Python, Java, and data analytics but also cultivated my expertise in data visualization, cloud-based solutions, and AI-driven technologies.

I thrive in roles that challenge me to leverage my analytical skills and technical knowledge to solve complex problems and drive innovation. I’m keen to connect with fellow tech enthusiasts, professionals, and innovators who are passionate about using technology to drive positive change. Let’s explore how we can collaborate to create impactful solutions.

For more information please check my Resume.

This repository was created to showcase my analytical and technical skills (Excel, Python, SQL, Power BI, Tableau, and others).

Contents

Portfolio Projects

This section contains a list of projects with brief descriptions.

Analyzing COVID RNA Sequences

Description: In this project, I delve into the RNA sequences of COVID, focusing on two significant variants: Delta and Omicron. RNA, a vital nucleic acid, serves as the genetic blueprint for COVID, facilitating its cellular entry and replication. By leveraging data from the National Institutes of Health (NIH), I dissect the metadata for each COVID RNA sequence to unravel insights into these variants.
Code: covid_genome
Original dataset: ncbi_datasets
Skills: analytical thinking, data cleaning, data analysis, data vizualization, presentations
Hard skills: MS PowerPoint, Python: Pandas, NumPy, Mathplotlib, Seaborn.
Results: Identified exact mutation points by finding alignments and mismatches between any two RNA sequences. Color coding resulted in visually differentiating alignments from mismatches, such as insertions, deletions, and substitutions, making it easier to interpret and analyze genetic variations.

Finding Heavy Traffic Indicators on I-94

Description: In this analysis, we’ll examine data related to traffic heading west on the I-94 Interstate. Our objective is to identify several factors that contribute to congestion on I-94. Potential factors include weather conditions, time of day, and day of the week, among others.
Code: I-94 Traffic
Original dataset: Metro_Interstate_Traffic_Volume
Skills: analytical thinking, data cleaning, data analysis, data vizualization.
Hard skills: Excel, Pivot Tables, Formulas, Functions, Charts, Dashboards, Slices, Pivot Charts.
Results: In this project, my aim was to identify indicators of heavy traffic on the I-94 Interstate highway. Through the analysis, I identified two main types of indicators:

Description: As part of the project, I utilized Python to extract and clean financial data from the Nasdaq API, conducted trend analysis and comparative studies on metrics such as Accrued Expenses Turnover, and employed Matplotlib to create visualizations for effective presentation of findings, enabling a comprehensive exploration of financial data.
Code: Exploring Financial Data
Original dataset: nasdaq_data
Skills: analytical thinking, data cleaning, data analysis, data vizualization
Hard skills: Python, Pandas, SQL, Excel
Results:

Analyzing Startup Fundraising Deals from Crunchbase

Description: As part of my project, I undertook an in-depth analysis of startup fundraising deals sourced from Crunchbase.com. Leveraging the techniques acquired in pandas, I thoroughly explored the dataset to unravel trends, patterns, and noteworthy observations within the realm of startup financing. This endeavor not only honed my skills in data analysis but also provided valuable insights into the dynamics of fundraising rounds in the startup ecosystem.
Code: crunchbase
Original dataset: crunchbase_investments
Skills: exploratory analysis, analytical thinking, , data vizualization
Hard skills: data cleaning, data analysis, Python, Pandas, SQL, Excel, Dashboards
Results: I gained a comprehensive understanding of the startup investments dataset. This exploration paved the way for insightful analysis, where I extracted valuable insights into fundraising rounds and identified notable trends in startup financing.

Study Projects

Telling Stories With Data

Website: Telling Stories With Data
Description: Welcome to my “Telling Stories With Data” portfolio from Carnegie Mellon University, where I transform complex datasets into clear, engaging narratives through innovative visualizations. My work, including a comprehensive final project, demonstrates my ability to reveal insightful stories hidden within data. If my approach resonates with you, I’d be delighted to explore how my skills can contribute to your team.
Skills: Python, Pandas, NumPy, Matplotlib, Tableau, Data Analysis, Data Visualization
Status: Completed in 2024

Window Functions SQL Analytics for Northwind Traders

Description: Suppose, I am a Data Analyst at Northwind Traders, an international gourmet food distributor. Management is looking to me for insights to make strategic decisions in several aspects of the business. The projects focus on:

Evaluating employee performance to boost productivity, Understanding product sales and category performance to optimize inventory and marketing strategies, Analyzing sales growth to identify trends, monitor company progress, and make more accurate forecasts, And evaluating customer purchase behavior to target high-value customers with promotional incentives.

Using the PostgreSQL window functions on the Northwind database, I will provide these essential insights to management, contributing significantly to the company’s strategic decisions.
Code: Northwind
Original dataset: Northwind Database
Skills: Advanced SQL and Database Management, Analytical and Critical Thinking, Data Manipulation and Analysis , Performance Metrics and KPI Development, Sales and Marketing Analytics, Trend Identification and Forecasting, Strategic Decision-Making

Amazon Stock Market Analysis

Description: In the business world, there are few places that generate more daily data than the stock market. Analysts can use this data to explore and explain the past as well as provide insights about the future. But where does one start? Using data visualizations to communicate knowledge and information to make wise data-driven decisions is a valuable skill for any business professional.

In today’s business landscape, the stock market churns out massive data daily. My project focuses on mastering data visualization to dissect this information effectively. By creating concise visual representations, I aim to uncover valuable insights for informed decision-making in the dynamic world of finance.
Report: Final Report
Original dataset: Amazon Stock Market Data
Skills: Excel, Analytical and Critical Thinking, Data Manipulation and Analysis , Data Visualization, Marketing Analytics, Trend Identification

Identifying Customers Likely to Churn for a Telecommunications Provider

Description: In this project, I will:

• Explore the different types of descriptive statistics.

• Learn how to identify the appropriate statistic to use in various scenarios.

• Apply these statistics to analyze real-world data.

• Enhance and support the analysis using suitable visualizations.

Specifically, I will work with data from a telecommunications provider to identify customers likely to churn. Through detailed exploration and analysis, I will use statistical methods and visual tools to uncover insights about customer behavior. Churn Rate=(Number of customers who churned/Total number of customers)∗100 The above metric can present us with a good overview of the percentage of user churn. As a business, of course, the goal is to minimize churn.
Report: Final Report
Original dataset: Customer Churn Prediction
Skills: Excel, Analytical and Critical Thinking, Data Manipulation and Analysis , Data Visualization, Marketing Analytics, Trend Identification

Analyzing Retail Sales

Description: In this project, I acted as an analyst for a chain of retail stores to study sales performance over the past few years. I prepared and profiled the data using VLOOKUP() and other Excel functions, then used PivotTables to aggregate and reshape the data to answer key questions. I conducted variance and trend analyses to identify strong-performing years and categories, and utilized statistical methods to analyze average order values. Using What-If Analysis tools, I helped management plan for various scenarios. My findings revealed that Q4 is the most profitable period, prompting further investigation into specific categories and months driving these profits. The project concluded with a comprehensive report in Excel, providing actionable insights and recommendations for strategic planning.

Report: Final Report
Original dataset: Retail Sales
Skills: Data preparation and cleaning, Data analysis and profiling using PivotTables, Variance and trend analysis, Statistical analysis of KPIs, What-If scenario planning, Generating actionable insights, and Creating comprehensive Excel reports.

Certificates

Contacts