Great post Abhinav. Learnt a lot. I wonder if you also cover or point me in the direction to understand how a tensor operation would become different at the library level from application point of view . For eg what changes (if any) needs to be done in either pytorch or the likes to better conquer this massive parallelisms offered by TSUs or is this completely taken by the Groq’s compiler behind the scenes . I understand Groqs hasn’t published anything yet but if you came across any nuggets on your research pls do share!
Great post, thank you very much!
Great post Abhinav. Learnt a lot. I wonder if you also cover or point me in the direction to understand how a tensor operation would become different at the library level from application point of view . For eg what changes (if any) needs to be done in either pytorch or the likes to better conquer this massive parallelisms offered by TSUs or is this completely taken by the Groq’s compiler behind the scenes . I understand Groqs hasn’t published anything yet but if you came across any nuggets on your research pls do share!
I'm so impressed with Groq, great to know more about the technical details.
Fantastic post Abhinav! Tremendous research and really topical too.
Thanks for the post, Abhinav! This was really insightful and I learned a ton.