suppose one has an idea for a different architecture / functional form etc, assu... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		DoctorOetker on Aug 30, 2022 \| parent \| context \| favorite \| on: Run Stable Diffusion on Intel CPUs suppose one has an idea for a different architecture / functional form etc, assuming the receiving model is substantially smaller so that the dominant computational cost is in the SD model, how long would effective knowledge distillation take on say a CPU?

astrange on Aug 31, 2022 [–]

That’s called teacher-student learning. It could still take weeks on a single machine easily, but renting more GPU time or getting free credits from somewhere is perfectly plausible.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact