Computational Biologist — Rossi
Trained large language models (LLMs) from scratch across RNN, LSTM, and Transformer architectures using full-cycle data curation and iterative training. Designed and applied reinforcement learning from human feedback (RLHF) and reward modeling to align model behaviors and outputs. Built bespoke domain-specific evaluation benchmarks with prompt engineering and ground-truth scoring for advanced model assessment. • Data lifecycle management encompassing collection, extraction, cleaning, and classification • Implemented structured reasoning and Chain-of-Thought (CoT) analysis with feedback-driven corrections • Developed adversarial evaluation for model robustness and safety • Constructed hand-crafted benchmarks and scoring pipelines using Python and PyTorch