Discussion about this post

User's avatar
Dr Teodora Szasz's avatar

Thanks for this, Neural Foundry — really appreciate you taking the time to leave such a detailed read.

Totally with you on IoU vs “95% accuracy”: segmentation is basically a class-imbalance booby trap, and “overall accuracy” can look amazing while the model quietly faceplants on the tiny-but-life-or-death classes (pedestrians/cyclists). Per-class IoU (and recall on those rare classes) tells the truth.

Also love that you called out the cGAN / synthetic weather angle — the long tail is where models go to get humbled, and generating credible edge conditions is one of the few practical ways to stress-test before reality does it for you.

And YES to your sensor-fusion nuance: it’s not just redundancy, it’s complementarity across physics + clocks. Cameras give dense semantics but come with latency; radar’s Doppler velocity is insanely valuable in real time. “Good fusion” is often more about time alignment + uncertainty modeling + ego-motion compensation than it is about just stacking features.

If you’re open to sharing: in your experience, what fusion approach held up best in practice — classic tracking + late fusion, or learned BEV-style fusion? Might be a perfect follow-up mini-post. 🚗⚡️

No posts

Ready for more?