Enable parallel MediaMTX playback by mjh1 · Pull Request #3396 · livepeer/go-livepeer

mjh1 · 2025-02-18T10:23:31Z

We want to try direct playback from MediaMTX to see how much startup time is saved from avoiding studio stream startup.

codecov · 2025-02-18T13:59:13Z

Codecov Report

Attention: Patch coverage is 0% with 66 lines in your changes missing coverage. Please review.

Project coverage is 32.16198%. Comparing base (33e1bf8) to head (7982356).
Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
server/ai_live_video.go	0.00000%	62 Missing ⚠️
server/ai_mediaserver.go	0.00000%	4 Missing ⚠️

Additional details and impacted files

@@                 Coverage Diff                 @@
##              master       #3396         +/-   ##
===================================================
- Coverage   32.19210%   32.16198%   -0.03012%     
===================================================
  Files            147         147                 
  Lines          40687       40722         +35     
===================================================
- Hits           13098       13097          -1     
- Misses         26816       26852         +36     
  Partials         773         773

Files with missing lines	Coverage Δ
server/ai_process.go	`0.59222% <ø> (ø)`
server/auth.go	`63.72549% <ø> (ø)`
server/ai_mediaserver.go	`7.28814% <0.00000%> (-0.03724%)`	⬇️
server/ai_live_video.go	`0.00000% <0.00000%> (ø)`

... and 1 file with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 33e1bf8...7982356. Read the comment docs.

Files with missing lines	Coverage Δ
server/ai_process.go	`0.59222% <ø> (ø)`
server/auth.go	`63.72549% <ø> (ø)`
server/ai_mediaserver.go	`7.28814% <0.00000%> (-0.03724%)`	⬇️
server/ai_live_video.go	`0.00000% <0.00000%> (ø)`

... and 1 file with indirect coverage changes

leszko

I think it looks good. Two things I'd check before merging and deploying to prod:

How much CPU/MEM usage of mediamtx pod increases with each viewer?

We have the following mediamtx kubernetes resources set, maybe we'll need to increase it.

resources:
      limits:
        memory: 4Gi
      requests:
        memory: 4Gi

If one ffmpeg RTMP command fails, does it still work? E.g. if you can't push it into studio, will the push to MediaMTX work? I think it should, but I'd double check because these multiwriter and all these io.writer in golang behaves weirdly sometimes.

Thanks Max 🙏

emranemran · 2025-02-18T15:36:23Z

server/ai_live_video.go

+					err = errors.New("unknown error")
+				}
+				clog.Errorf(ctx, "LPMS panic err=%v", err)
+				params.liveParams.stopPipeline(fmt.Errorf("LPMS panic %w", err))


Could we add a graph to our AI dashboard in grafana to track this?

I'll raise a ticket for that 👍

emranemran · 2025-02-18T15:38:28Z

server/ai_live_video.go

+			}
+
+			cmd := exec.Command("ffmpeg",
+				"-i", "pipe:0",


If the thread that pushes segments into the pipe:0 crashes, does this ffmpeg process die? what does recovery look like then?

This was from the previous logic, and yep that's right, if the thread pushing segments stops then stopPipeline would have been called here and so everything shuts down then the input stream can reconnect and try again.

emranemran · 2025-02-18T15:39:01Z

server/ai_live_video.go

+				// return
+			}
+			clog.Infof(ctx, "Process output: %s", output)
+			time.Sleep(5 * time.Second)


What is this 5s sleep for?

…ect the others

mjh1 · 2025-02-18T18:20:08Z

@leszko Very good point about the multiwriter, it does indeed fail if there are issues with either ffmpeg output, I've changed it now to a custom implementation which only returns an error if all writes fail.

I'll check out the cpu usage in staging 👍

j0sh · 2025-02-19T22:59:53Z

This breaks MediaMTX playback during local testing with the default setup unless a separate rtmp output URL is specified, because there are two output streams of the same name going into MediaMTX ... @mjh1 can you fix this?

j0sh · 2025-02-19T23:08:32Z

Also I believe we should be careful with the multi writer here because one blocked write can still stall the others; ideally they would be asynchronous (which also implies doing some buffer management). Not sure if this PR is meant to be a temp fix during the latency investigations though (in which case we can probably live with this blocking risk as long as things are reverted later on)

github-actions bot added go Pull requests that update Go code AI Issues and PR related to the AI-video branch. labels Feb 18, 2025

Enable parallel MediaMTX playback

201caa9

mjh1 force-pushed the mh/mediamtxpb branch from 422d580 to 201caa9 Compare February 18, 2025 11:21

Fix format

4001a05

mjh1 requested a review from leszko February 18, 2025 13:46

leszko approved these changes Feb 18, 2025

View reviewed changes

emranemran requested a review from j0sh February 18, 2025 15:34

emranemran reviewed Feb 18, 2025

View reviewed changes

emranemran approved these changes Feb 18, 2025

View reviewed changes

mjh1 added 2 commits February 18, 2025 17:13

DRY

3ad21a4

Switch to a custom multiwriter so that errors on one writer don't aff…

7982356

…ect the others

mjh1 merged commit 232df3a into master Feb 18, 2025
18 checks passed

mjh1 deleted the mh/mediamtxpb branch February 18, 2025 18:50

j0sh mentioned this pull request Mar 3, 2025

Live AI - remove studio output stream push #3429

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable parallel MediaMTX playback#3396

Enable parallel MediaMTX playback#3396
mjh1 merged 4 commits intomasterfrom
mh/mediamtxpb

mjh1 commented Feb 18, 2025 •

edited

Loading

Uh oh!

codecov bot commented Feb 18, 2025 •

edited

Loading

Uh oh!

leszko left a comment

Uh oh!

emranemran Feb 18, 2025

Uh oh!

mjh1 Feb 18, 2025

Uh oh!

emranemran Feb 18, 2025

Uh oh!

mjh1 Feb 18, 2025

Uh oh!

emranemran Feb 18, 2025

Uh oh!

mjh1 commented Feb 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

j0sh commented Feb 19, 2025

Uh oh!

j0sh commented Feb 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mjh1 commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

leszko left a comment

Choose a reason for hiding this comment

Uh oh!

emranemran Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

mjh1 Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

emranemran Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

mjh1 Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

emranemran Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

mjh1 commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

j0sh commented Feb 19, 2025

Uh oh!

j0sh commented Feb 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mjh1 commented Feb 18, 2025 •

edited

Loading

codecov bot commented Feb 18, 2025 •

edited

Loading

mjh1 commented Feb 18, 2025 •

edited

Loading

j0sh commented Feb 19, 2025 •

edited

Loading