Harsh Sharma

Harsh Sharma

NLP | Deep Learning Enthusiast | Data Science

SRM Institute of Science and Technology

Biography

I am currently working as BA3 Grad Analyst at Barclays - Pune, India. I have graduated as a B.Tech CSE UnderGrad Student at SRM Institute of Science and Technology, Chennai. Being a Data Science Enthusiast, passionate deep learning developer and researcher, who loves to work on projects belonging to NLP and Data Science Domain. ALways up for hackathons and part of student-led tech clubs, handling executive board positions.

I am also quite active in research domain - especially NLP and Deep Learning.My contribution as a co-author in one of the research papers also got selected for IEEE-INCET 2021 and Springer - Education and Information and technologies (Impact factor 3.6). Currently, I work mentoring juniors in hackathons, Hosting and organizing webinar sessions on deep learning, writing blogs based on deep learning-based natural language processing content, and collaborating with fellow deep learning developers on state-of-the-art projects and research titles constantly.

Recently I interned as MITACS Globalink Research Intern 2022 at University of British Columbia - Okanagan, Canada. I worked under Dr.Fatemeh H Fard for a software engineering based NLP Research Project - Neural Source code intelligence - I developed language models by pre training roBERTa with java bytecode and finetuning it to downstream tasks like Code Summarization and Code Search.

Download my resumé.

Interests
  • Natural Language Processing
  • Artificial Intelligence
  • Deep Learning
  • Time Series Analysis
  • Data Science
Education
  • B.Tech in Computer Science Engineering, 2019

    SRM Institute of Science and Technology, Chennai

  • CBSE - Higher Secondary Education, 2017

    Mahaveer Public School, Jaipur

  • CBSE - Secondary Education, 2015

    C.M. Academy , Ankleshwar

Skills

Deep Learning

Natural Language Processing (NLP) | Time-Series Analysis | Sequence Learning | Sequence to Sequence Networks | Classical ML

Frameworks

Tensorflow - Keras | Pytorch | BERT | Simple Transformers (Hugging Face) | Sentence Transformers (SBERT) | NLTK | Scikit - Learn

Databases

MongoDB (PyMongo) | Firebase (Pyrebase) | MYSQL (SQLite3)

Web Frameworks

Flask | Streamlit | FastAPI | Postman | HTML | CSS |

Programming Languages

Python | C/C++ | R

Cloud Platforms

AWS Basics (EC2-Instance) | Heroku | Azure Basics

Experience

 
 
 
 
 
BA3 Grad Analyst
Jul 2023 – Present Pune, India
  • Working in core ETL Team and Devops Team.
  • Working with Innovationshub for Generative AI based projects for production and testing efforts reduction.
 
 
 
 
 
Junior Data Scientist
Dec 2022 – Jul 2023 London (Remote from India)
  • NLP - Using Language Models to integrate and enhance existing models for Skill Matching and CV Parser.
  • Working on Building Recommendation Systems using Graph Neural Networks and Language Models.
 
 
 
 
 
MITACS GLobalink Research Intern - SDE - NLP
Jul 2022 – Aug 2022 British Columbia, Canada
  • Got Selected under prestigious MITACS Globalink Research Internship in 2022.
  • Worked as SDE-NLP Intern under Dr Fatemeh H Fard for NLP based project on Neural Source Code Intelligence.
  • Pre-trained roBERTa model for java bytecode to it’s respective documentations using Microsoft’s CodeSearchNET dataset
  • Fine-Tuning it to Downstream tasks like Code summarization and code search.
 
 
 
 
 
Deep Learning Intern - NLP
Jul 2021 – Present London HQ, Bengaluru Team
  • Worked on BERT based NER Performance Test on Indic Languages
  • Fine-Tuning Neural Machine Translation on Indic Languages using Marian MT and Pytorch.
  • Currently Working on Exploration of Topic Modelling using Sentence Transformers and BERT.
 
 
 
 
 
NLP Research Intern
National Institute of Technology - MNIT
May 2021 – Present Jaipur, Rajasthan
  • Worked on Multimodal Deep Learning based Neural Network Development on Research Paper
  • Multimodal deep learning - NLP + Image Feature Extraction for Fake or suspicious News Detection
 
 
 
 
 
MLH Pre-Fellowship
Jul 2021 – Present New York
  • Pre-Fellow at prestigious Major League Hacking with Summer'21 Batch.
  • Worked On Open Source based Python Package TF-Watcher - A Developer tool used track training of ML models in real time.
  • Bagged the title of “Best Project of the Pod” in tenure of Fellowship.
 
 
 
 
 
Microsoft Learn Student Ambassador - Beta Level
Jul 2021 – Present New York
  • Beta Level Member at Prestigious Microsoft Learn Student Ambassador
  • Selected as Alpha Level Member
  • Conducted Virtual Workshop on - Data Augmentation in NLP.
 
 
 
 
 
Machine Learning Engineer
Apr 2021 – Present Chennai
  • Part of Start-Up Product incubated by SRM.
  • Worked on Deep Learning Algorithms to derive insights about Fake reviews.
  • Developed Models For Detecting Fake Review and Sentiment Analysis and customized metrics.
  • Deployed and Hosted Via FastAPI on AWS EC2 Instance
  • Working on Custom Named Entity Recognition with Aautomated Resume Parser using BERT.
  • Heiphen is a revolutionary Employee Review Platform created to help employers get genuine employee reviews. We use Machine Learning and comprehensive database-building for thousands of candidate profiles to rate workers according to their proficiency through authentic reviews by their past employers.
 
 
 
 
 
Associate Technical Director and Deep Learning Researcher
Jul 2020 – Apr 2021 Chennai
  • Started as a Deep Learning Developer where I worked on some state of the art projects.
  • Later worked as Associate Technical Director where I directly managed and organised the events in the community. mentoring juniors and hosting workshops based on data science and deep learning domain.
  • My Contributions were :
  1. Conjexure-Stock Market Forecasting Project ( Conjexure is a machine learning web app for forecasting the stock prices of certain companies into the future, built using deep learning and deployed on streamlit.)
  2. Super-color Project (An ML-based web-application to colorize black and white images, two models trained on TensorFlow and deployed on Streamlit, recognized by the Streamlit developers and included in the worldwide weekly roundup of their Community Forum)
  3. Random Forest: The Optimal Choice For Developers Blog (blog published in the DataX Journal on medium)
  4. Developing Chatbots with RASA-From Intuition to Implementation and Deployment Blog (blog published in the DataX Journal on medium)
  5. Understanding Encoders and Decoders Blog (blog published in the DataX Journal on medium)
  6. Resourceify Contributor (Data Science, Machine Learning, and Artificial Intelligence resources repository on Data Science Community SRM GitHub)
  7. Version Control with Git and GitHub Webinar (Hosted an Internal Webinar session with 100+ attendees)
  8. NeuRes Series (Organized a 6-day long webinar series from 16th to 25th of March on the topic of basics of Deep Learning with over 500+ attendees. Hosted a Session on Deep Learning Practises in Natural Language Processing [NLP]).
  9. Data Science Community Outer Circle Discord (Managed a virtual community of 500+ members via Discord, interacting with deep learning enthusiasts.)
 
 
 
 
 
Community Executive
SRM Machine Intelligence Community
Jul 2020 – Present Chennai
Deep Learning Developer and Researcher at SRM Machine Intelligence Community, An undergraduate research community focusing on the latest happenings in the field of machine intelligence, producing impactful projects, and gaining meaningful insights.

Accomplish­ments & Certifications

An effective deep Learning Pipeline for generation, classification and recommendation in Bloom’s Taxonomy Domains.
Smart and Context-Aware System employing Emotions Recognition, IEEE-INCET 2021.Co-authored a research paper got selected in IEEE-INCET 2021 Research Conference.
See certificate
Coursera
Natural Language Processing in TensorFlow
See certificate
National Level 12 th position in Hack the CW Hackathon - Under Top 12 Finalists
See certificate
Secured 13th Position out of 400+ teams in Octhacks 3.0 Hackathon Under Top-20 Finalists
See certificate
Secured 8th Position out of 150+ Teams in SLAC 2.0 Hackathon, under Top 15 Finalists.
Coursera
Neural Networks and Deep Learning
See certificate
Python For Data Science And Machine Learning BootCamp
See certificate

Recent Posts

Projects

Recent Projects I worked on

*

Recent Publications

Quickly discover relevant content by filtering publications.
(2021). Smart and Context-Aware System employing Emotions Recognition. In IEEE.

PDF Cite Code DOI

Contact