Miniaturizing Models for microNPUs: a Cascading Scheduler for TVM