- 
                Notifications
    You must be signed in to change notification settings 
- Fork 8k
[PoC] Limited Abstract Generics #18260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| You are almost to full generics here :) ... Invariant is a good default (and usually the default for any generics). You only need a proper constraint resolver (this is partly why I was working on type trees in #18189, which would let you resolve covariant/contravariant constraints very easily). I'm working on that for zend_type the last couple of weeks -- which is far more complex. Potentially, between the two of us, we could enable something powerful here. I don't know if your intent is to get to full generics from here, but this is pretty similar to a couple of other experiments I've done. | 
| I was not really planning on going full generics. As the main issue with them from my understanding is determining the type to be bound to at runtime in a way that is not terrible for ergonomics and performance. Maybe @arnaud-lb could shed a bit more light. I didn't even think of a constraint resolver, but a few other people mentioned it and have an idea how to implement it, so will do that soon. | 
f72c2e6    to
    2baed8a      
    Compare
  
    | Interesting! In term of functionalities that has some similarities with @nikic's "purely abstract" generics [1] as well as @derickr Collections [2], in that we can not parameterize types at the point of use, but types can extend/implement parameterized types. One implication is that we can not use a type-with-assoc-types in type declarations, because this is not allowed: function f(I<T: Foo> $i) {}and this is unsound if  function g(I $i) {}Therefore, currently this seems most useful in traits and abstract classes? Could you expand on the relation with the Container/Offset RFC? Allowing assoc types in traits or abstract classes seems possible, but this increases complexity to a level comparable to [1], as assoc types on properties, method signatures, or method bodies would be handled at runtime (at least on abstract classes). Allowing  interface I {
    type T; // invariant by default
    function foo(T): T;
}here as well: interface J {
    type T; // invariant by default (covariant would be allowed)
    function foo(): T;
}but covariant here: interface K {
    type out T; // covariant
    function foo(): T;
}
 I confirm. There are some difficulties [3]: 
 [1] PHPGenerics/php-generics-rfc#45 | 
| Could you explain the unsoundness argument a bit more? I am struggling to see it. This experiment was mainly prompted about the discussion of allowing  <?php
interface I {
    public function set(never $offset, never $value);
    public function get(never $offset): mixed;
}With the intention that any implementation of said interface would specialize the types to be "sensible" e.g. <?php
class ListOfAnimals implements I {
    public function set(int $offset, Animal $value);
    public function get(int $offset): Animal;
}The proposal to allow  However, an associated type, even without being able to specify it in a type declaration, gives you at least the small guarantee that different methods use the same types: interface I {
    type K : int|string
    type V : mixed;
    public function set(K $offset, V $value);
    public function get(K $offset): V;
}
class ListOfAnimals implements I {
    public function set(int $offset, Animal $value);
    public function get(int $offset): Animal;
}This is basically also how it ties in to the Container/Offset RFC, because instead of needing to use  <?php
interface DimensionReadable
{
    public function offsetGet(mixed $offset): mixed;
 
    public function offsetExists(mixed $offset): bool;
}
 
interface DimensionFetchable extends DimensionReadable
{
    public function &offsetFetch(mixed $offset): mixed;
}
 
interface DimensionWritable
{
    public function offsetSet(mixed $offset, mixed $value): void;
}
 
interface DimensionUnsetable
{
    public function offsetUnset(mixed $offset): void;
}
 
interface Appendable
{
    public function append(mixed $value): void;
}
 
interface FetchAppendable extends Appendable
{
    public function &fetchAppend(): mixed;
}We could use a pair of associated type: <?php
interface DimensionReadable
{
    type K;
    type V;
    public function offsetGet(K $offset): V;
 
    public function offsetExists(K $offset): bool;
}
 
interface DimensionFetchable extends DimensionReadable
{
    public function &offsetFetch(K $offset): V;
}
 
interface DimensionWritable
{
    type K;
    type V;
    public function offsetSet(K $offset, V $value): void;
}
 
interface DimensionUnsetable
{
    type K;
    public function offsetUnset(K $offset): void;
}
 
interface Appendable
{
    type V;
    public function append(V $value): void;
}
 
interface FetchAppendable extends Appendable
{
    public function &fetchAppend(): V;
}Where  My main concern with supporting traits, is that I would be hitting the same issue, that I haven't resolved yet, when trying to resolve  I will also say that for this feature to be fully fleshed it does need to support property hooks, which might or might not be a challenge. | 
| 
 What I meant is that calling any method in  Thank you for the explanations. | 
| 
 I have some ideas here. Here's one I'd probably tackle first as a proof-of-concept: 
 If the type is affirmed, then the type in the zval can be inferred, otherwise, it is an error. function foo(SomeInterface $a) {
  new GenericArray($a); // type error: type cannot be inferred from SomeConcreteType
}
foo(new SomeConcreteType());It's not ideal, but it is pretty straightforward to reason about as a user. 
 I'm working on this, but I lack a lot of practical knowledge of the engine -- but getting there, slowly but surely. Feel free to beat me to it. | 
2baed8a    to
    cc039c9      
    Compare
  
    | @withinboredom this is an interesting idea as it makes inference works when the runtime and static types match. Unfortunately I think it’s unsound because calling foo() with a type accepted by its signature is an error. | 
cc039c9    to
    fd86e50      
    Compare
  
    14b6bb7    to
    4b8cb7e      
    Compare
  
    | --EXPECTF-- | ||
| Fatal error: Generic type cannot be part of a union type in %s on line %d | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When I saw my example in here, I got excited that you added basic union support. Nope! 😆
One day 🙏🏻
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As I said to Bob I really want to keep it as small as possible as it's already hurting my brain a bit! But this should be a rather easy limitation to lift :)
c5cedbd    to
    3a8b3d2      
    Compare
  
    3485776    to
    1679e6d      
    Compare
  
    | /* Check whether any type in the fe_type intersection type is a subtype of the proto class. */ | ||
| static inheritance_status zend_is_intersection_subtype_of_class( | ||
| zend_class_entry *fe_scope, const zend_type fe_type, | ||
| zend_class_entry *fe_scope, const zend_type *fe_type_ptr, | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the reason for passing all types as pointers?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
At the beginning when using the Associated Type syntax I was storing a pointer in the bound_types field as the type would have been stored in the arg_info of the first bound function. This doesn't really apply any more. Will revert the first commit.
Have a HT null access somewhere...
Use the index that is now part of the zend_type This should reduce memory consumption as we are not doing a second deep copy of the types Fixes one of the implicit interface tests
This is a proof of concept for a limited abstract generic types feature set, as those can be, and are, resolved at compile/linking time.
Implementation
Depends on:
The implementation is relatively dumb, and partially based on arnaud-lb#4 for parser/AST/compile shenanigans.
The generic types (name and constraint) are stored on the CE in a new
generic_parameterslist field.The bound types are also stored on the CE as a HashTable:
This means that this implementation cannot be extended to support concrete generics (i.e. generics on a concrete instantiable class), as those need to be tied to the instance of the CE, not the CE itself.
The generic types must be:
It is possible to
extendan interface with generic types, so that a sub-interface can reuse the same generic parameter.If one of the generic parameters of the interface being extended has a type constraint, this type constraint must be repeated on the child interface. As the type constraints for interfaces are checked when extending.
ToDos
Benefits
Although the lack of type declarations can make this unsound, in that a generic type
Tof an interfaceI<T : C>is no better than the type constraintC(which ismixed) by default.It does "solve" the primary need of wanting
neverto be useable for parameter types, being able to specify the actual type, and thus have engine type checking, on the concrete implementation. As this is currently prevented by LSP variance rules.One use case would be to use generic types
K, Von the new interfaces of my Container/Offset RFC instead ofmixed.Future scopes
T|nullvalid)