Rajesh N. Rao, PhD’s Post

View profile for Rajesh N. Rao, PhD, graphic

Staff Data Scientist at Walmart Global Tech India

Amazing talk on Physics of LLMs from ICML 2024. Many insights to experiment on. My key take aways - Decoder models > Encoder models. - LM > MLM - Rel, Rot Position Embedding > Absolute position embedding - Domain specific tags improve model training with corrupted dataset https://lnkd.in/g2hZDhwg

To view or add a comment, sign in

Explore topics