Built data simulator and continuous compliance monitoring using NLP/ML
I built a data simulator utilizing machine learning to continuously monitor compliance by annotating and classifying financial text data. The work included the use of NLP/ML techniques to create labeled datasets for automated systems supporting regulatory compliance. This process involved defining label categories and standardizing data definitions for more consistent analysis and reporting. • Developed labeled datasets of banking and compliance documentation • Applied NLP algorithms to annotate text for risk and regulatory categories • Employed classification and entity labeling in finance text • Used Python and proprietary tools for data processing and labeling