We don't really have time to go actually make stardis work, but the stardis tests are failing and we expect them to at this point, and it would be better for tardis development to change this behavior so we can more easily see if a given pr actually breaks tardis (the tardis tests or benchmarks or docs build).