Sunil Patel

Sunil Patel

Mumbai, Maharashtra, India
21K followers 500+ connections

About

A part of "TAC Team" of 24 people help clocking 1.5 Billion Dollars a year and…

Articles by Sunil

  • Building A Personal Deep Learning Machine

    Building A Personal Deep Learning Machine

    A worrier cannot fight longer on a rented sword. I have started working on deep learning in late 2014.

    10 Comments

Contributions

Activity

Join now to see all activity

Experience

  • NVIDIA Graphic

    NVIDIA

    Mumbai

  • -

    Mumbai, Maharashtra, India

  • -

    Mumbai Area, India

  • -

    Bengaluru Area, India

  • -

    Eschborn, Germany

  • -

    Hyderabad Area, India

  • -

  • -

    IIT-Delhi

Education

Licenses & Certifications

Publications

  • HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction

    5th ACM International Conference on AI in Finance [In Proceedings]

    Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to challenges such as domain specific terminology and complex formats of the documents. We introduce a…

    Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to challenges such as domain specific terminology and complex formats of the documents. We introduce a novel approach based on a combination, called HybridRAG, of the Knowledge Graphs (KGs) based RAG techniques (called GraphRAG) and VectorRAG techniques to enhance question-answer (Q&A) systems for information extraction from financial documents that is shown to be capable of generating accurate and contextually relevant answers. Using experiments on a set of financial earning call transcripts documents which come in the form of Q&A format, and hence provide a natural set of pairs of ground-truth Q&As, we show that HybridRAG which retrieves context from both vector database and KG outperforms both traditional VectorRAG and GraphRAG individually when evaluated at both the retrieval and generation stages in terms of retrieval accuracy and answer generation. The proposed technique has applications beyond the financial domain

    Other authors
    See publication
  • Fast, Self Supervised, Fully Convolutional Color Normalization of H&E Stained Images

    2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

  • Sample Specific Generalized Cross Entropy for Robust Histology Image Classification

    2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

  • BOOK - Getting started with Deep Learning for Natural Language Processing

    BPB Publications

    This book covers wide areas, including the fundamentals of Machine Learning, Understanding and optimizing Hyperparameters, Convolution Neural Networks (CNN), and Recurrent Neural Networks (RNN). This book not only covers the classical concept of text processing but also shares the recent advancements. This book will empower users in designing networks with the least computational and time complexity. This book not only covers basics of Natural Language Processing but also helps in deciphering…

    This book covers wide areas, including the fundamentals of Machine Learning, Understanding and optimizing Hyperparameters, Convolution Neural Networks (CNN), and Recurrent Neural Networks (RNN). This book not only covers the classical concept of text processing but also shares the recent advancements. This book will empower users in designing networks with the least computational and time complexity. This book not only covers basics of Natural Language Processing but also helps in deciphering the logic behind advanced concepts/architecture such as Batch Normalization, Position Embedding, DenseNet, Attention Mechanism, Highway Networks, Transformer models and Siamese Networks. This book also covers recent advancements such as ELMo-BiLM, SkipThought, and Bert. This book also covers practical implementation with step by step explanation of deep learning techniques in Topic Modelling, Text Generation, Named Entity Recognition, Text Summarization, and Language Translation. In addition to this, very advanced and open to research topics such as Generative Adversarial Network and Speech Processing are also covered.

    See publication
  • A study of traits that affect learnability in GANs

    2021 2nd International Conference on Computer Vision, Communications and Multimedia (CVCM 2021)

    Generative Adversarial Networks GANs are algorithmic architectures that use two neural networks, pitting one against the opposite so as to come up with new, synthetic instances of data that can pass for real data. Training a GAN is a challenging problem which requires us to apply advanced techniques like hyperparameter tuning, architecture engineering etc. Many different losses, regularization and normalization schemes, network architectures have been proposed to solve this challenging problem…

    Generative Adversarial Networks GANs are algorithmic architectures that use two neural networks, pitting one against the opposite so as to come up with new, synthetic instances of data that can pass for real data. Training a GAN is a challenging problem which requires us to apply advanced techniques like hyperparameter tuning, architecture engineering etc. Many different losses, regularization and normalization schemes, network architectures have been proposed to solve this challenging problem for different types of datasets. It becomes necessary to understand the experimental observations and deduce a simple theory for it. In this paper, we perform empirical experiments using parameterized synthetic datasets to probe what traits affect learnability.

    See publication
  • DeepLNC, a long non-coding RNA prediction tool using deep neural network

    Network Modeling and Analysis in Health Informatics and Bioinformatics

Patents

  • System and Method of Documenting Clinical Trials

    Issued US US20200105379A1

    This patent covers the following functionality:
    - Differentiating primary and secondary Publication related to Clinical Trial
    - Associating Clinical Trials to the Journal Publications based on the content of Clinical Trial and Publication
    Technology used: Siamese Networks for with Convolution and Recurrent components, Attention based text classifiers.

    Other inventors
    See patent
  • System and Method for Comparing Plurality of Documents

    Issued US US20200104359A1

    This patent covers the following functionalities :
    - Comparing documents semantically
    - Comparison tools that show addition modification and deletions.
    - Online learning capabilities - Semi-supervised learning
    - Works on sentence and paragraph level

    Technology used: Custom Sentence Vectorizer (Modified InferSent), Siamese Network

    Other inventors
    See patent
  • System and Method for Language-independent Contextual Embedding

    Issued US US20200311345

    This patent covers the following functionality:
    - Character-based language independent embeddings generation in an unsupervised manner
    - Multilingual alignment through custom loss function
    - Using Skip connection for easing gradient propagation
    - Snapshot ensemble technique to retrieve multiple models in a single run.
    Technology/logic used:PyTorch, Snapshot ensemble, Skip connection, Random multimodel CNN-LSTM ensemble

    See patent
  • System and method for creating database query from user search query

    Filed US US20210034621A1

    Disclosed is system for creating database query from user search query. The system comprises computing device for receiving user search query. The system further comprises processing arrangement communicably coupled to computing device. The processing arrangement comprises query component parser for identifying one or more attributes of user search query. The processing arrangement further comprises one or more component resolution modules. The one or more component resolution modules is…

    Disclosed is system for creating database query from user search query. The system comprises computing device for receiving user search query. The system further comprises processing arrangement communicably coupled to computing device. The processing arrangement comprises query component parser for identifying one or more attributes of user search query. The processing arrangement further comprises one or more component resolution modules. The one or more component resolution modules is operable to receive one or more attributes of user search query; convert user search query into sentence vector; trigger, based on one or more attributes, at least one module from a set of modules; provide sentence vector to triggered at least one module; and receive output from triggered at least one module to obtain database query. Disclosed further is method for creating database query from user search query using aforementioned system.

    Other inventors
    See patent

Projects

  • Developing a Custom Language Translation Engine for Life Science

    -

    Developing Language translation engine which understands the nuance of biomedical language.
    Tools/Technology : OpenAI Transformer, Nvidia-Docker, GPU

  • Primary/Secondary Clinical Trial Linking

    -

    Linking clinical trial as primary or secondary by comparing the content of clinical
    trial with millions of research papers.
    Tools/Technology: Skip-thought Sentence vectors, Siamese Networks, Pytorch, GPU
    Provisional patent in the United States: US16145828

    Other creators
  • Document Comparison

    -

    Detecting syntactic and semantic similarity between two documents also detect Insertion,
    deletion, and Modifications. Tools/Technology: Skip-thought Sentence vectors, Siamese
    Networks, Pytorch, GPU
    Provisional patent in the United States: US16143976

    Other creators
  • Developed Highly Scaleble Named Entity Resolution utilizing GPU Computing

    -

    Completed and delivered a general purpose framework for any kind of Named Entity Resolution (NER) problem. The solution uses state of art ensemble model of Convolutional Network and Long Short-Term Memory (LSTM), runs on GPU to deliver the best in comparison to conventional NER solutions. #nvidia-GPU #tensorflow #NER

    Other creators
  • Market Hawk

    -

    To constantly see what's trending in news, A generalized end to end solution utilizing Google news, Twitter handles, LinkedIn pulse and many more sources. Solutions that can assimilate gigs of data and present you with the nice dashboard of relevant plots. # GPU #Keras #django

    Other creators
  • Data Extraction Form Strips Of Medicines

    -

    Extracting information such as Mfg date, Exp date, batch number and active ingredient
    from medicinal strips using YOLO and other image processing techniques.
    Tools/ Technology used:- Tesseract, Pytorch ,Python

  • Financial Reconciliation using Deep Learning

    -

    AI-driven reconciliation for banking financial record linkage. Developed Deep Learning-based, Scalable architecture on Spark to process a high volume of banking data.
    Tools/Technology used: H2O. Python, Spark

    Other creators
  • Conversational Interface

    -

    For internal IT service enhancement and as a part of Ignio (TCS's IT Cognitive System for
    enterprise IT Ops) Completed a project on building conversational system using Natural
    Language Processing utilizing Word2Vec and DNN.
    Tools/ Technology used: - H2o, Gensim, Python

    Other creators
  • DeepInteract: Deep Neural Network Based Protein-Protein Interactions Prediction Tool

    -

    A Deep belief network performed Amazingly well in prediction 3D protein-protein interaction. This project was part of my master's thesis to outreach my research work. Deep belief networks to predict protein-protein interaction at scale.

    See project
  • Custom Elmo Embedding Training

    -

    Modifying Elmo code to custom tokenize according to biomedical tokens and training
    such model to achieve greater accuracy in downward tasks. Developing ELMO web API to integrate it with other architecture such as Pytorch
    Tools/Technology used : Elmo-bilm, Pytorch, Python, Nvidia-docker

    Other creators

Languages

  • English

    Full professional proficiency

Recommendations received

4 people have recommended Sunil

Join now to view

More activity by Sunil

View Sunil’s full profile

  • See who you know in common
  • Get introduced
  • Contact Sunil directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Sunil Patel in India

Add new skills with these courses