A simple way to do this would be to:
- Make a flag that can be passed through to the specify the device (so you can use the CPU).
- Add a flag and alternative translation pipeline that just echos back the source segments.
- Use the NLLB tiny random model.
- Specify a configuration using these flags/this model in the launch.json.