ISCA'23 - Lightning Talks - Session1A - Understanding and Mitigating Hardware Failures in Deep Learn