Short course of four meetings given in Portuguese at FGV EMAp, with the aim of introducing the area of Mechanistic Interpretability for Large Language Models (LLMs).
AI Safety
2024
Replication of “Towards Automated Circuit Discovery for Mechanistic Interpretability” paper, by Arthur Conmy et al., part of the process of upskilling in Alignment for AI Safety by Juan Belieni and Ana Carolina Erthal funded by Condor Initiative.
Condor Camp
·57 words
Condor Camp was an amazing event on AI safety that happened in Mexico City. There, I learned and discussed topics related to AI governance and technical AI safety. I was also introduced to the effective altruism philosophy. It was probably the best experience regarding career planning as well.