Rishi Iyer

researcher at zyphra
currently working on novel hardware aware pretraining architectures for low latency inference at scale, diffusion language models, sample efficient context extension, spectral clipping optimizers
-Rishi

Feel free to email me, I am almost always interested in meeting new people.

Email Twitter

recent work

ZAYA1-8B Technical Report

https://arxiv.org/abs/2605.05365v1

Training Foundation Models on a Full-Stack AMD Platform

https://arxiv.org/abs/2511.17127

Compressed Convolutional Attention

https://arxiv.org/abs/2510.04476