Skip to content
View ram-senth's full-sized avatar

Organizations

@getsentry
Block or Report

Block or report ram-senth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ram-senth/README.md

Ram is very passionate about solving real world problems with machines and data. He has 15+ years of software engineering experience in distributed systems, backend engineering, data engineering, ML engineering and big data. Last 5 years he was focused on large scale feature engineering, experimentation, machine learning, ML infrastructure and backend services.

This portfolio page has a compilation of projects done by him while doing the Masters in Information and Data Science (MIDS) degree from University of California, Berkeley.

Resume
LinkedIn Profile

Project Portfolio

AI Based Conversational Language Tutor

Chatbot for interactive language learning using natural conversations, implemented using Generative AI, LLM fine tuning, Retrieval Augmented Generation (RAG), NLP techniques, AWS SageMaker, Lambda, AWS amplify, PyTorch, WandB, HuggingFace and LangChain.

Presentation Deck

High Level Approach System combined more deterministic discriminative models with more random generative models. with this approach, the system is more predictable when explaining errors and, at the same time, adds meaningful variety in chat response.
  1. SentenceTransformer and semantic similarity search based classification model to keep conversation on topic
  2. Fine tuned BERT as Grammatical Error Correction model to classify each word into one of multiple error classes
  3. Mistral7B based LLM model for generating chat response
Key Challenges
  • Random nature of LLMs based text generation interferes with language learning. Addressed using a combination of both discriminative and generative models.
  • Class imbalance is a tricky issue with NLP. Addressed using appropriate loss function and evaluation metrics.
  • Lack of readily available dataset for evaluating if chat is on topic. Leveraged ChatGPT to create a custom dataset.
Highlights
  • One of six projects (out of 41) selected for project showcase

  • Grammatical Error Correction model achieved 0.90 macro averaged F1 score, an improvement by 0.28 over the baseline model.

StoryBot: Interactive Story Generation For Kids

NLP sentence continuation model to enable an interactive story development experience for children. This project involved fine tuning three transformer based models - one decoder only and two encoder/decoder - on sentence pairs from children story books.
Presentation Deck
Final Report

Data Visualization:

A visual exploration of global trends in children vaccination with goal of understanding the challenges in achieving higher vaccination rates and identifying key insights that improve effectiveness of vaccination drives.

Website

Pinned Loading

  1. team-langbot/conversationally team-langbot/conversationally Public

    AI based conversational language tutor

  2. datasci-w266-2023-spring-team-story-bot datasci-w266-2023-spring-team-story-bot Public

    StoryBot - interactive story generation for kids

    Jupyter Notebook

  3. w209_vax_viz w209_vax_viz Public

    Visualization of global vaccination data from UNICEF

    JavaScript

  4. W281/w281-fall2022-section2-team3 W281/w281-fall2022-section2-team3 Public

    Section 2 Team 3's repo for w281 (fall 2022) final project

    Jupyter Notebook 2