-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCaml's functors vs. typeclasses #22
Comments
Actually no, I'm referring to something simpler: Consider a TotalOrder type class. So Integer is totally ordered right? Which way? Up or down? You see? In Ocaml it is no problem because you have module IntegerUpOrder = ... but in Felix or Haskell you're screwed. You only get ONE bite at the cherry. Type classes are strictly weaker. They fix the semantics of some notation permanently and irrevocably. Another example from Felix which is very annoying: I had to define AdditiveGroup and MultiplicativeGroup. TWO type classes. One uses + and one uses * operator. But they're both groups with the same structure. In Ocaml, no problem. One functor. Two instances. Because the instances are nominally typed. Of course the nominal typing is a serious PITA which makes Ocaml functors really hard to use compared to say C++ templates or Felix type classes. Roughly, a type class is a nominal type which provides structural typing for instances. BTW: both mechanisms are pretty bad in the sense that neither supports composition very well. You just try to define a Ring out of two Groups in Ocaml! How does maths do it? With context sensitive language. That is, DSSLs. You can't read any maths papers without first understanding the symbolic language used in that domain, and, often, in that particular paper! |
@skaller wrote:
I had written about this before (citing Robert Harper from CMU) that in Haskell a typeclass can only be implemented in one way, but afaik this is because of Haskell's global type inference. Yet with local selection of implementations, then it is possible to have multiple variants of implementation for a typeclass for a particular data type. Are you referring to something else because I don't see a limitation being inherent to typeclasses or how OCaml has improved this?
Why not define a Group typeclass that takes a type parameter for its group operation?
I'm too lazy to try to map out what you mean. Could you elaborate a bit or show example? |
FYI, AFAIK Haskell doesn't have type inference at all (it has kind inference though). In any case even if the instance bindings are local, you still have s problem if you want two distinct bindings for the same types in the same context. The reason I can't define a Group typeclass that takes a type parameter for the group operation is that types are not functions. The group operation is a function. Felix typeclasses only allow type parameters, not function or value parameters. Of course I could add function parameters. But guess what? That's just an Ocaml functor which takes a module argument, where modules include both types AND functions. As to composition, its well understood that neither type classes nor modules are easy to combine. I once asked on the Ocaml list how to make a Ring out of an AddGroup and a MulGroup and it took the foremost experts a couple of days to get it right. The reason is that the types of each are distinct as written and so to make them the same type requires tricky syntax, which turns out to be ad-hoc and limited in what it can do. In Felix (and I guess Haskell) you can derive a type class from one or more others, but such derivation is not combinatorial. What we want is combinators to systematically add features together and this is really only possible for type classes that have a single type, and functions that accept such type but return an invariant type. In other words, its just OO at the compiler level, with the same limitations (covariance problem). In principle, we "know" how to combine things because category theory tells us. In practice the modelling is so complex no one can understand it. That's basically why people use dynamics, because the correct static modelling is just too hard. So actually, the way to do this is part of the design goals of Zen: the compiler's type system has to be a full scale purely functional programming languages. That doesn't tell you how to implement the required combinators but it does at least ensure that when you figure it out, you have the tools to do it. |
@skaller wrote:
Ah I guess we just used different terminology where you mean group types aren't inferred by type constructors are. I would need to think that out a bit...
Ah so overloading, but how do you select the desired instance within the context? Do you annotate somehow or is the proper overload inferred from context somehow? Excuse my myopia.
Seems we are back full circle to the point of what I wrote 6 days ago (as linked from my OP).
A code example might help me understand better. |
@shelby3 wrote:
I think what you mean is that you don't want to subsume the implementations of the typeclass to the typeclass, but rather track the intersection type of the two available implementations and then monomorphize that statically at the call site based on which implementation the call site requires? So I think a typeclass that takes the group operation as a type parameter suffices, as the function definition can require a specific group operation instance for the typeclass. The key insight appears to be my solution to the Expression Problem so you don't lose track of the type, so this delayed binding to instance is possible. |
"Ah so overloading, but how do you select the desired instance within the context? Do you annotate somehow or is the proper overload inferred from context somehow?" You're hitting the point backwards. The point is you CANNOT select the proper instance. Just think of i < j, where you have Up integer order and Down integer order. How can you possibly choose? In Ocaml you choose because functor applications have nominal types: you have to name them: UpInteger.less (i,j) or DownInteger.less (i,j). Since these are modules you can also do: open UpInteger to avoid explicit qualification everywhere. In Haskell and Felix you CANNOT do that because the instance doesn't have a name. It is selected entirely by the type. A typical work around is like this (in Felix code): union DownInt = Down of int; and now .. Down i < Down j .. to get the unusual reversed total ordering. The problem is, you cannot add two DownInt. You have to define addition (in Felix code again): fun + (x: DownInt, y: DownInt) => match x,y with So really, DownInt's are not integers, until you equip them with all the usual operations. Usually you don't, you just use the Down constructor where you need it to force a distinct typeclass instance but you cannot do that if you are trying to use an existing algorithm that requires Down ordering and 20 other functions defined on ordinary integers but not downwards ones. So the workaround is only a local workaround. the only real solution is to define another function such as "reverse_less" in the total order type class and then you have to parametrise ALL your algorithms that require a total order to say which order to use. |
The answer is type classes are used to model algebras. Order is an operation, so you pass a relation as an argument. There 'ralation' would be a type class. See Stepanov's "Elements of Programming" it's all explained in there. |
Sorry I don't have it (Stepanov) and it isn't online so I can't read it (and I'm not rich enough to buy a book with examples in C++). But you have missed the point. With Haskell (or Felix) type classes you can have an algorithm and pass the ordering in as a type class, but you can only have ONE ordering for a given type such as integer. What I explained was that there is a local workaround where you use instead a variant "Down of int" which is of a distinct type so you can construct a different order for it and pass that in. Unfortunately, you can't add those integers. So you have to make a complete copy of the whole model of an integer for Down integers. Which is entirely nontrivial and in fact probably completely impossible if you have foreign libraries which use integers directly instead of using a type class. The simple fact is most uses of integers are using integers, not a type class based model of integers with instances other than integer. Unfortunately many types have more or less unique properties and modelling everything with type classes is going too far. In fact one could argue that types are so specialised that to use classes, be they OO classes or type classes, is a waste of time since to get a refined enough view, you would only have one function per class. At that level, composition is isomorphic to a soup of functions: you gain no benefit because you have not abstracted anything. |
@skaller I have difficulty understanding your explanation. I am unable to be sure if I can guess what you are describing in words, and a complete example might be helpful. My best guess is you want to compose over functions that employ the One solution appears to be what @keean suggests of having all such functions manually input the operation function separately from the data type which is being input. But I am thinking I don't agree @keean that this can't and shouldn't be done with typeclasses. Edit: I see @skaller has made a similar point:
Another way appears to be to select into scope only the implementation instance of the So I guess I am failing to see why the intended goal can't be handled with typeclasses? |
OK, sorry. Here is a simple example, from the Felix library:
That sorts a list. Now consider something that builds, say, a N-M search tree or something really complicated and messy. A Zen compiler maybe .. :) Now what can you do if the order is the reverse of what you need? You cannot provide a second type class using the other order for the same type. You're only allowed one, so you can only sort in ascending order. The problem is you wrote the code making assumptions, and didn't explicitly pass in the ordering function. You use a type class instead and now you're very sorry! Any single example I give will admit a modification, but the point here is that if you wrote the nasty complicated algorithm using an Ocaml module, you could reverse the order without changing the nasty complicated algorithm. You can't fix it with type classes, you have to either BYPASS type classes to fix it, or, you have to construct a new type, an integer with the usual ordering reversed, and use that. The latter solution is not workable in more general cases, because you often have simple data types modelling complicated things. The problem is that tying the algebra to the types is just wrong. Algebras have to be modelled entirely with functional relations using categories (modules in Ocaml) and various constructions including functors (functors in Ocaml). Type classes just don't work. Basing structure (algebra) on types is wrong. Technically, two types have ONLY one property of any significance: they're either the same type or not. That is the end of it. Types have no other utility than providing identity. Just for example consider a product like int * string, value (42, "Gitchiker"). you may think this type has structure. WRONG. A product is NOT a type. Its a pair of projections! What I mean is that a product is characterised by the projections. The actual type has no utility other than the fact it is different to other types. That difference determines which projections can act on it. Its the projections that have the universal properties that define what a product is. So again: type classes are wrong. Because they base a fixed set of related functions on some specific types, as if the types have some significance. But this doesn't mean not to put them in Zen. I have them in Felix. Haskell has them. |
@skaller wrote:
Here is where you lose me. Who is 'you' in the code? The caller of |
@skaller wrote:
Afaics, you are not factoring in that the selection of the implementation of a typeclass is not solely determined on the data type being implemented for. As I pointed out before, it is also possible to control the selection at the call site (although apparently not for Haskell). How else could you possibly do it other than to select a set of operations to pass into the call? What does OCaml do which is better than that? |
@skaller wrote:
Having written several large programs using both module and typeclass systems, I feel able to comment: Modules produce a proliferation of boilerplate. In effect if you parameterise a function on modules you have to provide all the modules explicitly. Modules have type parameters, and you rapidly run into the situation where you need to provide the same type parameter to several modules. When you do this you introduce redundancy into the signature, that invites mistakes at worst, or is needless repetition at best. The advantage of type classes is precisely that they are implicit. When I call '+' on a type, the type provides all the information to know what implementation of plus to choose. We can correctly have a type class for '>' or '<'. So I am not sure what your problem with type classes is? Sorting is an algorithm, not an algebraic operator. You have this very problem in English, if I say "sort this class of people by height" what order will I get them back in? We can only say something is in a ordering, or partial ordering with respect to a particular relation operator. |
Actually he is right. Type classes select which implemenration to run purely on the type of the arguments (not their values) hence the name type-classes. Different implementations can get despatched from the call site only because the types of the arguments are different. |
I agree with your analysis: type classes are weaker but easier to use. The problem is simple, for a class X there is only one instance for T = A. You cannot pass a function parameter to a type class, the index is exclusively a (list of) type(s). Ocaml functors allow you to also pass functions as parameters. The theory is simple in principle: a functor is a mapping which takes a subcategory of TYPE as an argument and produces a new sub-category. Anything else is weaker. Type classes are therefore weaker because they only take types to construct a new subcategory. The arrows are missing as parameters. Its a tradeoff between ease of use and power. Functors are correct, because category theory is correct. It is the only known working model of abstraction and structure. Exactly how to represent them in a programming language is another issue though :) TYPE is the name I give to the category of types and functions, Haskell calls it asterisk. This is the category we write code for. Denotational semantics is nothing but a functor from TYPE to SET, the category of sets and functions. In Felix you can actually define functors: typedef fun swap (x:TYPE, y:TYPE): TYPE => y,x; which is a bifunctor from TYPE * TYPE to TYPE. The effect on the functions is left out, but can be captured with the Functor type class which has method "fmap" in Haskell (it should be called map but they messed up by already defining it :) |
@keean wrote:
Disagree. I can control which implementation gets dispatched for each type by choosing which implementations I import into the scope of the call site. Which is what I already wrote two times in this thread.
Values? Are we asking for dependent typing? Why does he want to dispatch based on the values? Did you intend to write polymorphic type? |
@skaller wrote:
I request you please take the time write more coherently. I can't follow your descriptions. What the hell is Please take the time to explain so that readers can understand what you are talking about. From my perspective you are largely writing incomprehensible gibberish. Perhaps @keean can read between the lines, but I can't. Please be more explicit and develop your descriptions with complete code examples. |
Shelby, how do you "control which implementations get imported into the call site"? You cannot do that in Haskell. You cannot do that in Felix. And you cannot do it (legally) in C++ either. |
Re T=A, I'm sorry, I didn't proof read, and the idiot markup crud removed stuff. It should have read
meaning the instance of X with T set to concrete type A. The less and greater signs got gobbled. |
@skaller wrote:
Only what you import is visible to the compiler in the file or module you are compiling. C++ has an include. |
This is true if overlapping instances are allowed. Normally instance sets are required to be coherent, so only one instance globally can match one type, this means changing what you import cannot change the meaning of the program. We probably want to warn if instances are incoherent. |
@keean we can't enforce globally and have true Git-style modularity. I explained this to you before and apparently you forgot already, because this concept is apparently foreign to the way you conceptualize. You keep thinking that you can have a runtime loader that enforces some global consistency, but that is the antithesis of decentralized development. You keep forgetting the Second Law of Thermodynamics. Just abandon global consistency and global soundness, as it does not exist. For if such total orders could exist, then we could not exist because the speed-of-light would not be quantifiable. Why the heck would you want to make all instances of implementations of a data type the same? A per call-site decision isn't a problem afaics. For our TypeScript transpile hack we need to assume they are, but after we write our own JavaScript compiler, I don't see why they need to be coherent. EDIT Jan. 2018: A linker can insure a total ordering of implementing typeclasses only one way for each data type. My thought above was for hot pluggable code over the unbounded Internet, which require some sort of cryptographic hash id in order to enforce said total ordering in a decentralized context. |
@shelby3 instances need to be coherent to avoid non-local definitions changing local behaviour. If instances do not have global coherence then importing a module or runtime loading a DLL could change the behaviour of the program in unexpected ways. You need global coherence of instances so that you can reason locally about code, without remote changes affecting it. |
@skaller The point of type-classes is to enable generic programming. When we write a function we want to be able to set various requirements on the types passed in. For example:
So here we can see the copy should be passed two The things we are passing to the function are Iterator objects, and we want to constrain the types to have certain properties. So you can understand type-classes as constraints on types. I don't really see the relevance of your criticisms of type classes. |
Ok, so in Felix, (1) each file is separately parsed, then the parse trees are concatenated based on include directives. Then we have During monomorphisation polymorphic bindings to type class functions are mapped to instance functions. An error is possible if the instance function does not exist. Instances can be defined anywhere at all. They do not have to be visible from anywhere at all. The typeclass must be visible to the instance. Felix allows overlapping instance definitions, and they can be polymorphic. If there are several instances definitions providing an instance definition of a type class function, then the one defined in the most specialised typeclass is selected if it exists, otherwise it is an error. Note that in Felix an instance does not have to provide all of the type class functions, this is not an error unless you try to use one you didn't define. Thus in Felix, if you include a file F containing an instance for a type class, Felix uses that, if instead you include G containing a different instance, Felix uses that, and if you include both it choses the most specialised. However file inclusion in Felix is transitive, has no impact on parsing, and is independent of the location of the include directive: all AST's of included files are concatenated. If you read that carefully you will see you cannot include a file in any scope other than the global scope, and inclusion can impact both binding and choice of type class instance, but I consider this a design fault in the system, precisely because it allows what is called hijacking, of both ordinary overloading, and also of the kind of overloading type classes provide. In Felix, type classes do not cause any run time overhead. In Haskell, a function accepts a dictionary containing an instance at run time. This is necessary because Haskell has separate compilation. Haskell, however, does not allow overlapping instance definitions. Thus, type classes introduce a performance overhead in Haskell. In Fortress, the choice of the most specialised method is done the same way as in Felix I think, however the choice is made at run time (late binding). Its a very sophisticated OO system with multi-methods based on run time type class stuff. In Ocaml, module instances are always visible at the point of use but abstraction introduces indirection and a performance overhead (unless the definition happens to be local or the trans-compilation unit inliner fixes it up). However Ocaml classes work differently, they perform run time (delayed) binding of methods. It uses lookup, but hash coding (like polymorphic variants) makes the lookup reasonably fast. |
@keean wrote:
Can't happen. The locality is in control of what it imports.
The locality can import the implementations it desires. If there is a conflict with other imports, the compiler should give an error.
I don't see why you think so. |
If you have a type class with a generic implementation (ie implements a function for a polymorphic type) and you use that in a module, and you import another module that does something useful (maybe written by someone else) that implements a more specific instance of the type-class for some type you are using, then the code will swap to using the new instance instead of the generic one. If the new instance is written in a different way it can change the behaviour. |
@skaller I am thinking what you want is to be able to pass in a data type and an interface to that data type orthogonally, and so that the binding of other implementations for other typeclasses is delayed and only the ones you have specified are selected early.
@keean I think we should consider adding this functionality combined with the ability to select different implementations for the same data type at the call site. It seems to be very powerful and will work with the unions concept I created to solve the Expression Problem. |
@keean Let me see if i understand what you mean by an output type: you would write something like
to say that V is a type label to use in a function signatures, but it is "dependent on I". Then you have to specify it in your instance:
Apart from my notation, is that the idea? The instance supplies, not only the functions, but it also supplies output types as well? Obviously, this is even more constrained than if the type V above were a parameter, but as we know type classes are even more constrained than modules and functors because you cannot index them on functions, whereas you can do that with modules/functors. So the additional constraint weakens the definition even further, but such weakening also has the advantage of reducing even more the amount of specification required to use them. i hope I understand. Felix doesn't have such dependent types and not having them means some constructions with type classes require manually adding the dependent type to the input list, which destroys the convenience of the notation. Do I understand correctly? |
@skaller wrote:
Yes the output type is functionally dependent on the input types. It is weaker than giving both as a parameter, but that is not what you want when programming. Given a collection of type 'x', I want to derive an iterator from the collection that iterates over the type of objects in the collection. Hence the iterator should determine the type of object. Its not just about ease of use, its about getting the algebra of collections and iterators to have the desired properties. |
Let me try to understand how the typing works. Consider first (Felix again sorry):
Since V is specified as a parameter, we can type check the assignment and the function return type. However this code has a serious problem in Felix! The procedure argument has type I: we cannot deduce the type V. So this use:
will fail because V cannot be deduced. We have to write instead
for example, to specify the value type (the iterator type is deduced). We have to do this because there is no constraint on the value type, it could be anything. Now clearly the dependent output type solves that problem, because you can now call f, and the iterator type I is deduced from the iterator argument. But what is the return type of f? That's a problem! It's not enough to know that it depends on I. So this is a real problem as I see it. We only know the type when monomorphisation selects the instance which then specifies the type. It's not good enough, because we bind the code polymorphically first. We can't type check. Felix does polymorphic type checking, instance binding happens later during monomorphisation. That is, the type checker just assumes the type class functions are parametrically polymorphic, even though this is a lie because only selected specialisations are actually defined in instances. I'm confused. I do not see how this can work. The problem is that the "functional dependence" of V on I is not accessible in the type class, that is the type function that calculates the type of V does not exist there: that function is the map from I to the type class instance settings of V for that I (for each concrete value of I). But when binding we do not consider instances. That happens later. The type checking is already complete. I have to know the type of every expression in Felix, because Felix has overloading. For any expression This is not true if you do not have overloading. You can defer the choice and use type inference to fill in the type variables later, because you only have one function |
@skaller wrote:
It seems to me we use types because it is a model in which we want to think when doing composition. We wish to think in terms of data, not in terms of relations of morphisms. Because it seems without types, we have a huge amorphous system of simultaneous equations of domains and codomains. Instead we create a model with the rigidity of types to provide comprehensible shapes so we may apply local reasoning. This has the disadvantage of reducing the degrees-of-freedom, whilst the advantages of approachable reasoning and feasible type inference. Category theory can only takes us so far, because humans can't program coherently in that highly abstract model of multiple orders of lifting. Sorry but if only 0.1% of the programmers can understand your code, then it is useless on any scale of mainstream open source (although might be acceptable for an open source niche such as mathematics). |
@skaller The functional dependence is on the type. If we have a collection of type
So the type of |
@skaller wrote:
Maybe I am misunderstanding, but this seems like a problem to me. If I have a collection of |
|
I feel I don't understand OCaml's functors well enough to understand what @skaller thinks we could do better? And I didn't get any feedback on my idea upthread. Could we please get some clear elucidation from you experts on whether we need some form of later or variable selectable binding of implementation to data type? I want to nail down the syntax and it seems the choice of syntax for typeclasses may depend on whether we need such a feature. |
@shelby3 Did you see my post about records providing some of the functionality, and if you use records, functors become ordinary functions (which is good in my opinion as why create a new category of functions that are somehow different from ordinary functions). Records have type parameters like modules. If records have associated types and private functions in the implementations (can easily be done by returning a record instance from a closure) we just need to make opaque types a global feature and I think we can do anything the OCaml module system can do with records. It would also remove the separate syntax for modules, as they would simply be records. And guess what, union types solve the module typing issue too, as the type of an associated type would be the union of all the different associated type definitions in the record values. I am not yet sure whether we can infer this type, or if you would have to list the possible types in a union in the record type. |
@keean I don't know what "associated type" is? Need to see code. I don't understand how records are applicable to late or selectable binding of implementations of typeclasses?
I have no idea what all that means. Could you translate it to a code example or otherwise explain it for a mere n00b like myself? Basically eveything you wrote is meaningless to me because I can't relate the terminology to anything concrete. |
@shelby3 An associated type is one where the type depends on the type parameters of the record, module or type class. consider:
The type of ValueType is completely determined by the type of This means we might do:
So what this means is that the record has a property called Regarding late binding, we can have a record like this:
We can define a value of this record type:
We can then pass this to a function as a first class value:
So we can pass in any record instance that provides functions that match the Record type, can then call them from within the function. |
@keean wrote:
Do you have a typo? Where is |
@keean wrote:
That binds interface type to data type which breaks any solution to the Expression Problem.
That does not provide late binding of selecting which one of multiple implementations of an interface we want for a particular data type. |
The function called is determined by the record passed at runtime, which is as late as you can get.
No the type of ValueType is defined in the implementation In the lambda cube this is a type dependent type. We want all corners of the lambda cube except types that depend on values. |
@keean wrote:
I know that the type unification is in the implementation. I meant where is |
@keean wrote:
The selection of the data type is late, but the binding of the interface to the data type is early. And I am repeating myself. Fundamentally you do not seem to maintain a conceptualization of the Expression Problem, and I often have to reexplain it. I will be making a long comment to explain all of this including what I think @skaller wants. |
This is true, but this is how modules and functors in OCaml work. I believe I understand what you have in mind for the expression problem. If we have a polymorphic function on type
and we call this from something where the type is unknown until runtime like:
Then although the type of However we will need run-time-type-information for the types in the union to implement the type-case (I know we are both aware of that, but I am putting it in for completeness). The above could work with modules, but you would have to pass a set of modules in that match the types in the union and do the type-case manually, which is lots of extra boilerplate, but you can do this already with the records we have. |
refresh |
@shelby3 wrote:
I realize the above is not @keean's preferred form. Any way that could be written instead (with a correction above as well):
I was trying to decide if I had to abandon the OOPish form of typeclass specification and I don't think so. |
But without types, then we do not have the foundational axioms for logic. I quote from Wikipedia:
|
@skaller wrote:
I'am with you with semantic specifications. |
@skaller wrote:
I wrote about this 6 days ago, and I think you are referring to the fact that OCaml's unique first-class functors recently have added implicit selection, so combined with modules this can apparently emulate typeclasses and more.
Could you explain about the limitations?
The text was updated successfully, but these errors were encountered: