add pypy-3.7 to test matrix #663

bollwyvl · 2021-06-30T15:36:46Z

Investigating downstream errors:

test vs pypy3 in CI ipython/ipykernel#700

Seeing if this test fail is spurious/azure/cosmic rays:

=================================== FAILURES ===================================
________________________________ test_shutdown _________________________________

    def test_shutdown():
        """Kernel exits after polite shutdown_request"""
        with new_kernel() as kc:
            km = kc.parent
            execute('a = 1', kc=kc)
            wait_for_idle(kc)
            kc.shutdown()
            for i in range(300): # 30s timeout
                if km.is_alive():
                    time.sleep(.1)
                else:
                    break
>           assert not km.is_alive()
E           assert not True
E            +  where True = <bound method KernelManager.is_alive of <jupyter_client.manager.KernelManager object at 0x000055773df9b0c0>>()
E            +    where <bound method KernelManager.is_alive of <jupyter_client.manager.KernelManager object at 0x000055773df9b0c0>> = <jupyter_client.manager.KernelManager object at 0x000055773df9b0c0>.is_alive

../_test_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placeh/site-packages/ipykernel/tests/test_kernel.py:373: AssertionError

from conda-forge/ipykernel-feedstock#84 (comment)

the test as-written is sensitive to garbage collection, which is different on pypy instead, test only that we create a valid tracker

wait for the expected output to appear, instead of assuming 1 second is enough this should be both faster (where sleep(1) was enough) and more reliable

minrk · 2021-08-09T09:18:00Z

There does appear to be a real failure and real difference in behavior with respect to signalling child processes of the kernel.

We use killpg to terminate the process group of the kernel. This is what's being tested in the failing test_signal_kernel_subprocesses. On pypy, we are getting the behavior of only signalling the kernel itself, whereas on CPython the child processes also get the signal (as intended).

I managed to reproduce it with this test:

test_signal_subprocesses.py

import os
import signal
import sys
import time
from subprocess import Popen

def child():
    print(f"child waiting: {os.getpid()}")
    try:
        time.sleep(10)
    except KeyboardInterrupt:
        print(f"child interrupted: {os.getpid()}")
        sys.exit(-2)

def parent():
    children = []
    for i in range(2):
        p = Popen(['bash', '-i', '-c', 'sleep 10'])
        children.append(p)

    print(f"parent waiting: {os.getpid()}")
    try:
        time.sleep(10)
    except KeyboardInterrupt:
        print(f"parent interrupted: {os.getpid()}")

    for child in children:
        child.wait()
        print(f"child {child.pid} status: {child.poll()}")


def test():
    p = Popen([sys.executable, __file__, 'parent'], start_new_session=True)
    pgid = os.getpgid(p.pid)
    print(f"parent pid: {p.pid}, pgid: {pgid}")
    time.sleep(2)
    print("signalling parent")
    os.killpg(pgid, signal.SIGINT)
    p.wait()


def main(which):
    if which == 'test':
        test()
    elif which == 'parent':
        parent()
    elif which == 'child':
        child()


if __name__ == "__main__":
    if len(sys.argv) > 1:
        main(sys.argv[1])
    else:
        main("test")

where CPython interrupts the bash children, but PyPy does not. I suspect this has to do with signal handler inheritance, and CPython is clearing signal handlers while PyPy doesn't? That's a guess. Switching the subprocess to use Python instead of bash results in interrupted children, though, so I assume the Python interpreter reinitializes SIGINT handler where bash does not.

@mattip I think this should be considered a bug in PyPy's subprocess module.

Switching the subprocesses to use Python instead of bash causes the test to pass.

PyPy doesn't create subprocesses in the same way as CPython, resulting in ignored signals when we try to interrupt with `killpg`

mattip · 2021-08-09T09:27:12Z

I think this should be considered a bug in PyPy's subprocess module.

Thanks for tracking this down. I will try to figure out what is going on.

wait for processes to exit, not for expected result otherwise, a timeout is forced for any incorrect result

minrk · 2021-08-09T09:37:28Z

The remaining failures (for me) are FD exhaustion, which makes me suspect that we are somewhere relying on garbage collection to clean up some zmq sockets or other. This is usually reliable on CPython, which is how we may have gotten into the situation without noticing, whereas PyPy often needs explicit gc calls. The 'right' fix is to add the necessary explicit closes on whatever resources it is that aren't being cleaned up.

mattip · 2021-08-09T09:59:53Z

I am not sure I understand the difference in behaviour. On Ubuntu 20.04, both cpython and pypy3.7-v7.3.5 print (at the end) for test_signal_subprocesses.py

signalling parent
parent interrupted: 245302
child 245303 status: -2
child 245304 status: -2

As for closing resources: if you could add a fixture to always call for i in range(3): gc.collect() at test teardown, that should trigger a ResourceWarning when the gc gets a file or socket that still holds a resource.

minrk · 2021-08-09T10:50:59Z

Ah, maybe it's a macOS thing, then. For me, the children run to completion with pypy and exit with status 0 (identical behavior to not signalling them at all). CPython behaves as expected, though.

Good call on the gc. I'm poking around to see where I can find leftover references. I think a fixture to assert that there are no open zmq resources across tests is the right thing to do, which ought to catch this kind of issue. There could also be other issues, like subprocess pipes which might not be closed.

bollwyvl · 2021-08-09T11:03:52Z

You good folk are wizards, thank you for pushing this forward!

so hangs don't run for hours pypy needs more than 10 minutes, cpython doesn't

Julian · 2021-08-09T13:15:10Z

There does appear to be a real failure and real difference in behavior with respect to signalling child processes of the kernel.

We use killpg to terminate the process group of the kernel. This is what's being tested in the failing test_signal_kernel_subprocesses. On pypy, we are getting the behavior of only signalling the kernel itself, whereas on CPython the child processes also get the signal (as intended).

I managed to reproduce it with this test:
test_signal_subprocesses.py

where CPython interrupts the bash children, but PyPy does not. I suspect this has to do with signal handler inheritance, and CPython is clearing signal handlers while PyPy doesn't? That's a guess. Switching the subprocess to use Python instead of bash results in interrupted children, though, so I assume the Python interpreter reinitializes SIGINT handler where bash does not.

@mattip I think this should be considered a bug in PyPy's subprocess module.

Switching the subprocesses to use Python instead of bash causes the test to pass.

(I'm coming in late here and possibly haven't read the thread carefully enough) -- but what difference should I be observing here? On Big Sur 11.4 (MBA M1 2021) I see no difference between running this on cpython3.8 and pypy3.7 (7.3.4)

bollwyvl · 2021-08-09T13:33:06Z

@Julian we initially discovered some of these challenges when running the full test suite of the downstream ipykernel on conda-forge:

conda-forge/ipykernel-feedstock#84 (comment)

Unfortunately, I know little about the inner guts of pyzmq, and less about the inner guts of pypy... but do try to keep these technologies working as broadly as possible... luckily far wiser folk have appeared!

blink1073 · 2022-03-28T19:44:48Z

I added pypy-3.8 in #757

mattip · 2022-03-29T04:26:29Z

I added pypy-3.8 in #757

Cool. It is passing (without coverage) in a similar time to the CPython runs. It seems this issue can be closed? Please do ping me if tests break.

blink1073 · 2022-03-29T09:56:54Z

It seems this issue can be closed? Please do ping me if tests break.

Sounds good, thanks @mattip

bollwyvl changed the title ~~add pypy3 to test matrix~~ add pypy-3.7 to test matrix Jun 30, 2021

This was referenced Jun 30, 2021

expand test matrix to windows, macos and pypy-37 erdewit/nest_asyncio#55

Merged

Restore pypy builds for 6.x conda-forge/ipykernel-feedstock#85

Closed

blink1073 added the maintenance label Aug 5, 2021

mattip mentioned this pull request Aug 6, 2021

Help needed understanding a test failure in jupyter_client + pypy zeromq/pyzmq#1581

Open

bollwyvl and others added 5 commits August 9, 2021 10:07

add pypy3 to test matrix

9a2f7d3

linting

b08c968

no mypy on pypy (part 1)

efb19df

make mypy cpython-only

399313f

avoid testing underlying pyzmq machinery in test_tracking

a212f90

the test as-written is sensitive to garbage collection, which is different on pypy instead, test only that we create a valid tracker

minrk force-pushed the pypy3-test branch 3 times, most recently from 783882d to 6b04978 Compare August 9, 2021 08:23

pexpect-style wait for output in test_kernelapp

c0f12fb

wait for the expected output to appear, instead of assuming 1 second is enough this should be both faster (where sleep(1) was enough) and more reliable

minrk force-pushed the pypy3-test branch from 6b04978 to c0f12fb Compare August 9, 2021 08:49

use Python for signal kernel subprocesses

fda489a

PyPy doesn't create subprocesses in the same way as CPython, resulting in ignored signals when we try to interrupt with `killpg`

change wait condition in poll

321094a

wait for processes to exit, not for expected result otherwise, a timeout is forced for any incorrect result

minrk added 2 commits August 9, 2021 12:58

local provisioner: ensure pipes are cleaned up using Popen.__exit__

2be2332

missing stop_channels in parallel tests

9a28aea

minrk force-pushed the pypy3-test branch 3 times, most recently from db4ee2d to 4d433eb Compare August 9, 2021 12:22

minrk added 2 commits August 9, 2021 15:06

check for leftover zmq resources across tests

5c131da

add timeout to test job

efcab39

so hangs don't run for hours pypy needs more than 10 minutes, cpython doesn't

minrk force-pushed the pypy3-test branch from 4d433eb to efcab39 Compare August 9, 2021 13:06

blink1073 closed this Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add pypy-3.7 to test matrix #663

add pypy-3.7 to test matrix #663

Uh oh!

bollwyvl commented Jun 30, 2021

Uh oh!

minrk commented Aug 9, 2021 •

edited

Loading

Uh oh!

mattip commented Aug 9, 2021

Uh oh!

minrk commented Aug 9, 2021

Uh oh!

mattip commented Aug 9, 2021

Uh oh!

minrk commented Aug 9, 2021

Uh oh!

bollwyvl commented Aug 9, 2021

Uh oh!

Julian commented Aug 9, 2021

Uh oh!

bollwyvl commented Aug 9, 2021

Uh oh!

blink1073 commented Mar 28, 2022

Uh oh!

mattip commented Mar 29, 2022

Uh oh!

blink1073 commented Mar 29, 2022

Uh oh!

Uh oh!

add pypy-3.7 to test matrix #663

add pypy-3.7 to test matrix #663

Uh oh!

Conversation

bollwyvl commented Jun 30, 2021

Uh oh!

minrk commented Aug 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattip commented Aug 9, 2021

Uh oh!

minrk commented Aug 9, 2021

Uh oh!

mattip commented Aug 9, 2021

Uh oh!

minrk commented Aug 9, 2021

Uh oh!

bollwyvl commented Aug 9, 2021

Uh oh!

Julian commented Aug 9, 2021

Uh oh!

bollwyvl commented Aug 9, 2021

Uh oh!

blink1073 commented Mar 28, 2022

Uh oh!

mattip commented Mar 29, 2022

Uh oh!

blink1073 commented Mar 29, 2022

Uh oh!

Uh oh!

minrk commented Aug 9, 2021 •

edited

Loading