Case Study : Leverage NLP techniques for risk classification of legal

Key Challenges:

Approach:

Pre-processing engine for white-space removal, punctuation-removal, stop-words removal, etc.
Term document matrix creation
Text Classification and NLP Algorithms were leveraged to build the foundational ontology using feature extraction
Use Expectation Maximization Algorithms to transfer the classification knowledge across languages, by translating the model features
Use the extracted feature set, in conjunction with business rules to flag contracts into three risk categories – high, medium, and low
Validate results against test set, and have incorporate feedback loop to continuously improve the model accuracy

Benefits: