Hello! πŸ‘‹ Welcome to my Portfolio 😊

Data storyteller with a knack for turning numbers into narratives that inspire innovation and spark growth.

I'm Shubham Pandkar, a Data Scientist based in Pittsburgh, PA United States. πŸ‡ΊπŸ‡Έ

./img201.jpg
I'm a Data Scientist with a strong foundation in machine learning, NLP, and predictive analytics. My journey began with a degree in Electronics and Telecommunication Engineering, followed by hands-on experience at Infosys, where I built predictive models for fraud detection and streamlined claims processing. After earning my Master's in Data Science from New Jersey Institute of Technology, I joined Tango Analytics and then PNC Financial Services, where I applied advanced modeling techniques to drive customer insights and efficiency. Skilled in Python, SQL, Machine Learning, Natural Language Processing, Gen AI and cloud platforms, I'm passionate about solving real-world challenges and creating business impact through data.
About Me
I'm very flexible with time zone communications
./chess1.jpg
./grid.svg
Artificial Intelligence, Big Data, Algorithms, Neural Networks, Sentimentality, Dataography, Image Processing
Interests
./grid.svg
Communication, Critical Thinking, Collaboration, Adaptabilty, Business Acumen, Curiosity
Soft Skills.
./techstack.jpg
My Tech Stack
Apache SparkPythonSQLHadoopTeradataHive
Machine LearningDeep LearningNLPLLMRNNStatistics
NumPyPandasscikit-learnTensorFlowPyTorchTableau
Reach out to me with opportunities

A small selection of recent projects

bgimg
cover

Book Recommendation System

Explore the top 10 recommended books based on search criteria.

bgimg
cover

Quora Question Pair

Predicting if questions posted on quora has similar exxsting question pair.

bgimg
cover

Heart Disease Prediction

Identifying if a person will have a heart disease in 10 years based on health information.

bgimg
cover

Calibrating Credibility of COVID-19 Tweets

Tableau Dashboard with senttment analysis on how COVID tweets are distributed.

Degrees and Certifications

  • New Jersey Institute of TechnologyMaster of Science in Data Science - Computational track
  • Savitribai Phule Pune UniversityElectronics and Telecommunications Engineering
  • RedHatRedhat Certified System Administrator
  • John Hopkins UniversityData Scientist's Toolbox
  • IBMData Visualization and Dashboards

My Work Experience

My approach

Problem Understanding & Framing

I understand the business context, define the problem, set success metrics and formulate the hypothesis.

Data Exploration, Preparation, & Modeling

Once the hypothesis is formulated, I collect the data, clean and pre-process it, extract features which is followed by model selection with training and evaluation.

Communication, Deployment and Continuous Improvement

Present the model results and insights to the stakeholders. Once everything is approved, deployment of the model and maintenance is carried out. This is followed by research to improve the results.