Added Standard methods to Ontology #246

francescalb · 2021-09-29T20:58:23Z

Description:

Closes #228.

hash (based on base_iri) defined for ontology.Ontology (only req to allow for eq implementation)

eq (based on all triples) defined for ontology.Ontology. Note that consistency between blank nodes is not checked.

This is tested in test_load_emmo.py.
The test also checks that two equal emmo-ontologies are no longer equal when a new class or new relation is added to one of them.

Type of change:

Bug fix.
New feature.
Documentation update.

Checklist:

This checklist can be used as a help for the reviewer.

Is the code easy to read and understand?
Are comments for humans to read, not computers to disregard?
Does a new feature has an accompanying new test (in the CI or unit testing schemes)?
Has the documentation been updated as necessary?
Does this close the issue?
Is the change limited to the issue?
Are errors handled for all outcomes?
Does the new feature provide new restrictions on dependencies, and if so is this documented?

Comments:

ontopy/ontology.py

ontopy/patch.py

It now works, but only when the test includes checking for classes and base_iri of the ontology. The test does not include properties.

Preffered is hash on iri, second, try prefLabel, third try name fourth try label

everything except annotation_properties and Individuals

Note that consistency of blank nodes is not checked.

CasperWA

Looking great. I have only some minor code optimizations and a more significant question concerning the _sorted_entities() method.

ontopy/ontology.py

tests/test_load_emmo.py

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

ontopy/ontology.py

francescalb · 2021-11-01T13:13:53Z

Why? I guess changing s to subject, p to predicate and o to obj can be done, but spo is quite standard when looking at triples I think.

Then, doing the _unabreviate in the yield, do wish for that because it is faster or just because you think it is prettier? If it is faster I agree, but otherwise I am not a super fan of compacting code all the time, as it is more difficult to read for new people coming in I think.

CasperWA · 2021-11-01T13:25:52Z

Why? I guess changing s to subject, p to predicate and o to obj can be done, but spo is quite standard when looking at triples I think.

Single character variables is very bad practice when collaborating on code.
While it may be "standard" to use s, p, and o as one character variable names for subject, predicate and object in ontologies, it is extremely opaque for a non-initiated. Writing the variables out as subject, predicate, obj (not object, because that is a built-in variable), makes it much more transparent what we are dealing with.
At the very least, to make the variables similar, one could call them sub, pre, obj, however, even that becomes unclear, as "sub" is merely a prefix for anything, as is "pre". It could be anything.

In general, using single character variables is a leftover bad practice from programmers of compiled languages, where one wants to compact as much as possible, while it's uncommon multiple people get to work with the source code who's not deeply initiated in everything about the code base. For open code projects and general readability one should strive to be more explicit and forgiving with the number of characters used.

As a note, this is something I've gone through and changed in the whole of the code base in #245 as well.

Then, doing the _unabreviate in the yield, do wish for that because it is faster or just because you think it is prettier? If it is faster I agree, but otherwise I am not a super fan of compacting code all the time, as it is more difficult to read for new people coming in I think.

I am generally against creating unnecessary variables.
The definitions of s, p, and o originally, overwriting the iterating variables is bad practice - one shouldn't change these in the loop. This both avoids doing that as well as simplifying the code.
One could make it slightly more explicit perhaps here by splitting it on several lines, like such:

return (
    _unabbreviate(s),
    _unabbreviate(p),
    _unabbreviate(o),
)

I think, in general you are correct to be a bit more explicit in the code in terms of middle-steps, however, in this case it's so simplistic there is no need for this, I should think?

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

francescalb · 2021-11-01T18:36:26Z

OK, fixed.

jesper-friis · 2021-11-01T21:46:05Z

Took a look at the conversation above. I am not sure I fully agree that using one-letter is always bad practice. In general I agree that descriptive variable names are good, but sensible use one-letter variables makes the the lines shorter and the code easier to read. If you just need an integer variable in a loop, I think that i or n are completely fine to use. The same is the case with s, p and o in triples. In contrary I am not much fan of _ as a variable you actually use, but if you only want to loop over the objects in a set of triples I think it is fine to write for _, _, o in triples: ....

CasperWA · 2021-11-02T07:55:28Z

Took a look at the conversation above. I am not sure I fully agree that using one-letter is always bad practice. In general I agree that descriptive variable names are good, but sensible use one-letter variables makes the the lines shorter and the code easier to read. If you just need an integer variable in a loop, I think that i or n are completely fine to use. The same is the case with s, p and o in triples. In contrary I am not much fan of _ as a variable you actually use, but if you only want to loop over the objects in a set of triples I think it is fine to write for _, _, o in triples: ....

The underscore is the accepted Pythonic way of defining un-used variables, specifically in the example you show above, but also in other cases, e.g., when a function returns a Triple type and all you need are one of those.

I think exactly because they are single-character variables they are not easy to read, quite the contrary, it makes it impenetrable to understand, especially when the method or function names do not mention triples at all.
To align the variable naming a bit more, one can write out the "object" variable name, but append an underscore, as this is the way to avoid renaming built-ins according to PEP-8, i.e.: for subject, predicate, object_ in get_me_some_triples(): ....

I agree that single-character variables can be used as iterating variables. This is common practice across several programming languages - but for ease of readability I'd like to strongly support more descriptive variable names throughout the code base.

CasperWA

This looks excellent to me now :) Thank you @francescalb !

CasperWA · 2021-11-02T07:58:27Z

Everything said - we can always introduce our own standards for some variable naming, but I think it should be documented well then so there is a resource to go to if one is looking at this as a coder, but without an understanding of the content that's being worked with.

CasperWA · 2021-11-02T12:02:49Z

This is all good for me. Let's discuss naming conventions perhaps in #245? As I'm doing a lot of different updates in the way of naming in that PR.

CasperWA · 2021-11-02T12:18:41Z

Going through the code locally in my editor, I realize now you have overwritten the inherited get_triples() function. Was this intentional? If yes, perhaps it should reuse the same parameters and such? Either that, or the method should be renamed, e.g., to get_unabbreviated_triples() or something?

CasperWA · 2021-11-02T12:23:52Z

The latest commit from #245 includes my suggested change/fix for this.

francescalb · 2021-11-02T12:31:56Z

You are right. That is not good at all. Your fix should be included.
Thanx!

CasperWA · 2021-11-02T12:33:18Z

Great - I'll make an extra issue and PR for it and redact my other PR.

francescalb changed the title ~~Flb/close 228 ontology stadand magic methods~~ Added Standard methods to Ontology Sep 30, 2021

CasperWA reviewed Oct 7, 2021

View reviewed changes

ontopy/ontology.py Outdated Show resolved Hide resolved

CasperWA reviewed Oct 7, 2021

View reviewed changes

ontopy/patch.py Outdated Show resolved Hide resolved

francescalb added 8 commits October 25, 2021 12:37

Added __hash__ and __eq__ to ontology.Ontology

7e2499c

Sorting emmo classes with iris does not work. checkout

7e6ab17

Added test for ontology equality

180a05a

Creating hash and testing ontology equality

475683d

It now works, but only when the test includes checking for classes and base_iri of the ontology. The test does not include properties.

Extended possibilities for ThingClass hash

3b6585e

Preffered is hash on iri, second, try prefLabel, third try name fourth try label

Only hash on iri for ThingClass in hash, and ontology hash including

d2ad916

everything except annotation_properties and Individuals

Sorting emmo classes with iris does not work.

63513fe

Only use iri for calculating the hash

3fa0e52

CasperWA force-pushed the flb/close-228-Ontology_stadand_magic_methods branch from 08cc67d to 3fa0e52 Compare October 25, 2021 11:57

francescalb added 2 commits October 25, 2021 22:56

Added __eq__ in Ontology based on triples

4b08fa6

Note that consistency of blank nodes is not checked.

Added testing of new relation

61011d5

francescalb marked this pull request as ready for review October 25, 2021 21:07

francescalb requested a review from CasperWA October 25, 2021 21:07

CasperWA requested changes Nov 1, 2021

View reviewed changes

francescalb and others added 4 commits November 1, 2021 13:32

Name of inferred ontology not shortened any longer

b7371d0

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Removed helper function to sort entities as it is no longer needed

14ff1c2

Improved on return of dummy variables

b8e700d

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

Clarify documentation of __eq__ for Ontology

1da61a5

CasperWA requested changes Nov 1, 2021

View reviewed changes

ontopy/ontology.py Outdated Show resolved Hide resolved

write out short names and skip unnecessary intermediate variables

76edb29

Co-authored-by: Casper Welzel Andersen <43357585+CasperWA@users.noreply.github.com>

francescalb requested a review from CasperWA November 1, 2021 18:36

CasperWA approved these changes Nov 2, 2021

View reviewed changes

Too long line with long variable names fixed

1ea3bc9

francescalb merged commit ffdecd8 into master Nov 2, 2021

francescalb deleted the flb/close-228-Ontology_stadand_magic_methods branch November 2, 2021 12:04

CasperWA mentioned this pull request Nov 2, 2021

Overwriting get_triples() method #280

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Standard methods to Ontology #246

Added Standard methods to Ontology #246

francescalb commented Sep 29, 2021 •

edited by jesper-friis

Loading

CasperWA left a comment

francescalb commented Nov 1, 2021

CasperWA commented Nov 1, 2021

francescalb commented Nov 1, 2021

jesper-friis commented Nov 1, 2021

CasperWA commented Nov 2, 2021

CasperWA left a comment

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

francescalb commented Nov 2, 2021

CasperWA commented Nov 2, 2021

Added Standard methods to Ontology #246

Added Standard methods to Ontology #246

Conversation

francescalb commented Sep 29, 2021 • edited by jesper-friis Loading

Description:

Type of change:

Checklist:

Comments:

CasperWA left a comment

Choose a reason for hiding this comment

francescalb commented Nov 1, 2021

CasperWA commented Nov 1, 2021

francescalb commented Nov 1, 2021

jesper-friis commented Nov 1, 2021

CasperWA commented Nov 2, 2021

CasperWA left a comment

Choose a reason for hiding this comment

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

CasperWA commented Nov 2, 2021

francescalb commented Nov 2, 2021

CasperWA commented Nov 2, 2021

francescalb commented Sep 29, 2021 •

edited by jesper-friis

Loading