1. Guided a team of 6-8 data scientists at Happiest Minds Technologies on the following AI / Cognitive Computing Projects:
    1. Smart Kiosk – Smart Kiosk is developed by data science and analytics team at Happiest Minds for retail customers. In addition to the barcode scanner, smart kiosk has computer vision technology to recognize fresh produce (like fruits and vegetables), and optical character recognition (OCR) to read product labels, brand-names and logos. Once the product is recognized, smart kiosk shows relevant information like nutritional facts and recipes for food products, and related items for cross-selling.
    2. Video Highlights Generation – The algorithm uses audio signal processing to detect cheering, clapping and applause, combined with speech recognition on audio commentary to automatically generate highlights for sports videos.
    3. Damage Detection – This deep-learning based image classifier uses TensorFlow model to identify signs of visible damage like scratches or cracks on phone screens, car windshields and glass windows to speed up the process of insurance claims processing.
    4. Audio Classification – Detects laughing, crying, screaming / shouting, clapping, cheering, singing from audio signals.
    5. Health Bot – A conversational chat interface to search nearby doctors and hospitals by specialty areas like cardiologists, dentists, pediatricians etc.
    6. Video Analytics – Face Recognition, Person Tracking, Vehicle Number Plate Recognition for surveillance cameras.

  2. Built a media player and car infotainment system on Raspberry Pi.

  3. Created a culinary search-engine to search thousands of online recipes based on the given ingredients (e.g. rice, lentils, potatoes, spinach, corn, mushroom) or category (e.g. sandwich, soup, cake, breakfast, brunch, vegetarian). Sample search-queries include: “apple pie”, “carrot cake”, “mushroom soup”, “potato salad” etc.

  4. Projects done in MakeMyTrip
    1. Built a destination-search engine by mining articles on Wiki-Travel, to suggest top domestic and international cities for a selected activity (e.g. scuba diving, ice skating, cross-country skiing, trekking, horse riding etc) or theme (e.g. wild life safari, hill station, beach resort, amusement parks, world heritage sites).
    2. Built a hotel-search engine by mining reviews from Trip Advisor. Hotels can be searched near specific points-of-interests (POIs) like metro station, airport, popular landmarks, local attractions and neighborhoods, or by specialty services like Italian Dining, Infinity Pool, Private Beach etc.
    3. Performed sentiment analysis on text-snippets (phrases) extracted from online hotel reviews to score and rank hotels on various dimensions like location, quality of food, service, cleanliness and amenities.
    4. Trained a word2vec model to automatically discover concepts from text by identifying semantically similar or related terms. Created visualizations by projecting word vectors in 3D space using Embedding Projector visualization tool from Google.

  5. Projects done in Persistent Systems Ltd
    1. Automatically generated skill-profiles for technical support engineers by mining unstructured text-content from email communications. The algorithm retrieves a ranked list of experts in the given skill-areas.
    2. Built a demo prototype for a client that works in medical and health-care domain by analyzing the content posted on social media. The project identifies trending topics, user activities, as well as, the sentiments in user posts, to study the effects on product sales and revenue.
    3. Performed predictive analysis for a client that offers bike rental service in the Bay Area, by computing correlations between bikes rented and weather parameters like temperature, wind speed, humidity etc. on a given day.
    4. Developed forecasting models for predicting stock prices for Fortune 500 companies based on the historical data for the past 10 years using Time-Series Analysis.
    5. Implemented brand-clustering algorithm to identify companies that work in the same industry sectors or offer similar products and services, by extracting features from online news text using word2vec utility.
    6. Mentored a team of college interns from College of Engineering, Pune (COEP) on “Box-Office Predictions” project, to estimate the box-office revenue for movies, based on parameters like reviews & ratings, production budget, studio, cast and genres.

  6. Proposed a novel technique for detecting and analyzing humor in comedy television show FRIENDS. This research was published in one of the top international conferences in Natural Language Processing area (EMNLP) held in Sydney, Australia in 2006. The paper analyzed dialog transcripts and audio recordings in FRIENDS TV-show for automatic humor detection.

  7. Implemented Text Mining and Information Extraction algorithms for automatically building a large-scale relational database for movie actors and pop-singers by mining web pages and biographies on Wikipedia. This project was carried out at SONY Corporation, Japan during summer 2008 internship program.

  8. Explored data mining techniques to automatically categorize product items based on the similarity of their features and product descriptions. This project was done at, Seattle during summer 2005 internship program. The project utilized unsupervised clustering algorithms to automatically build a product taxonomy by grouping similar products.

  9. Developed classification algorithms for predicting the email reply order for Automatic Email Prioritization. The project analyzed user behavior and inter-personal relationships among users, along with the features extracted from emails to predict the email-reply order for prioritization.

  10. Proposed a Machine Learning framework for analyzing coherence in spoken conversations. The algorithm tries to distinguish random incoherent conversations from natural coherent dialogs with over 85% accuracy. This research was published in FLAIRS 2008 conference held in Florida.

  11. Implemented a language semantics demo for analyzing similarities and relations between entities from natural language texts. This project was done at the Information Sciences Institute (ISI) of University of Southern California (USC) during summer 2007 internship.

  12. Developed an open source software package SenseClusters for Unsupervised Word Sense Discrimination task. This project was funded by the National Science Foundation (NSF) research grant and was completed as part of the Master’s Thesis at University of Minnesota.

  13. Worked with collaborators at Mayo Clinic in Rochester, Minnesota on bioinformatics project to develop unsupervised learning methods for resolving ambiguities in biomedical texts.

  14. Won 2nd Prize for Best Paper in Artificial Intelligence & Fuzzy Logic in technical/research paper presentation competition organized by Pune Institute of Computer Technology (PICT), India in association with IEEE. The paper presented a prototype model for a language understanding system using parsing and logical inference.

  15. Worked on a Contextual Advertising project for a start-up company, Social Extract. The project analyzes text content on Twitter to identify twitter users and relevant tweets for advertising and marketing campaigns.

— Last Updated in Sep 2017