Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tools are scattered #2

Open
dbooth-boston opened this issue Dec 7, 2018 · 24 comments
Open

Tools are scattered #2

dbooth-boston opened this issue Dec 7, 2018 · 24 comments
Labels
Category: tools For RDF tools primarily-tools Tool makers should address this website Central RDF website should address this

Comments

@dbooth-boston
Copy link
Collaborator

How to find them? Which to use?
Every team wastes time going through a similar research and
selection process.

"Beginners drown in the options."
https://lists.w3.org/Archives/Public/semantic-web/2018Nov/0229.html

IDEA: LAMP for RDF

Create a bundled release of RDF tools, analogous
to a standard LAMP stack, or Red Hat or Ubuntu; so that if
someone wants to use RDF all they have to do is install that
bundle and they're ready to go.

Richard Cyganiak called this the "LARP" stack:
http://richard.cyganiak.de/blog/2005/09/rdf-and-web-applications/

"such a platform already exists :) We call it LinkedDataHub":
https://linkeddatahub.com/docs/about
https://lists.w3.org/Archives/Public/semantic-web/2018Nov/0052.html

IDEA: semantic web generic client in a box

"Simply start a server pointing at a SPARQL endpoint(s). Optionally
configure it with some example queries, metadata about what property to
derive labels (or introspect this from the endpoint itself), and get:

  • A form interface for SPARQL queries with a menu of example queries
    that give you an idea of how to get started, plus some useful standard
    prefixes
  • Simple linked data browsing (using conneg for html vs turtle)
  • Dumb generic graphical browsing, a la neo4j
  • Basic schema derivation; or just stats on most used classes and
    properties
  • A generic semantic hypermedia API
  • An autogenerated domain-specific swagger API (e.g. using garlik)"
    https://lists.w3.org/Archives/Public/semantic-web/2018Nov/0046.html
@dbooth-boston dbooth-boston added the Category: tools For RDF tools label Dec 8, 2018
@azaroth42
Copy link

You lost the target audience with "Simply ... SPARQL endpoint". In other words, the first sentence.

@maximveksler
Copy link

Might consider producing a family of https://yeoman.io generators.

@maximveksler
Copy link

btw, regarding "best" - the awesome list concepts growing on GitHub over the years seems useful. I'm contributing to https://github.com/semantalytics/awesome-semantic-web for that reason.

@dbooth-boston dbooth-boston added website Central RDF website should address this primarily-tools Tool makers should address this labels Mar 11, 2019
@dbooth-boston
Copy link
Collaborator Author

I've taken a crack at starting a list of software that might become a bundled LAMP-like release of software that a new RDF user would need to start building "typical" RDF applications:
https://github.com/w3c/EasierRDF/blob/master/RDF-LAMP.md

It's very meager so far. Anyone have suggestions for additions or changes?

@TallTed
Copy link
Member

TallTed commented Sep 21, 2022

Your single listed data store (Blazegraph) is no longer (for some years now!) supported nor maintained, as it has been subsumed into Amazon Neptune.

At the risk of blowing my (employer's) own horn, I would suggest Virtuoso Open Source be on the list, as it is the engine behind the majority of nodes in the LOD Cloud.

I would also nominate the OpenLink Structured Data Sniffer which helps reveal and/or confirm the structured data, in islands or markup or otherwise, found in any given web page. Several other of our browser extensions might also be well included.

I might also suggest retitling RDF-LAMP to Ramp or RDF-amp, because, to start with, Linux is not (always) part of your stack, given you've mandated that this stack run on Windows, macOS, and Linux. The other letters in that acronym (Apache, mySQL, and PHP/Perl/Python) also deserve demotion, as these are not the key ingredients of any RDF-based solution; an "amp[lifier]", howeer....

@dbooth-boston
Copy link
Collaborator Author

Your single listed data store (Blazegraph) is no longer (for some years now!) supported nor maintained, as it has been subsumed into Amazon Neptune.

Ugh, thanks for catching that. I hadn't realized that they stopped supporting the open source version. That disqualifies it. :(

At the risk of blowing my (employer's) own horn, I would suggest Virtuoso Open Source be on the list, as it is the engine behind the majority of nodes in the LOD Cloud.

Sure, I'll list that at least for the moment. Thanks!

I would also nominate the OpenLink Structured Data Sniffer which helps reveal and/or confirm the structured data, in islands or markup or otherwise, found in any given web page.

Would you consider that similar to the RDF Browser Firefox plug-in that Andreas Harth mentioned today on the semantic-web list?

I might also suggest retitling RDF-LAMP to Ramp or RDF-amp . . . .

I agree that it should be renamed, but I'd like to wait a bit to see how things evolve and see what other name ideas emerge, to avoid renaming it multiple times.

@kurzum
Copy link

kurzum commented Sep 21, 2022

Fun fact: Virtuoso "Universal" Server came out some years after Apache. It is called Universal, because it is actually AMP, i.e. it has web server, relational database & triple store as well as Virtuoso PL, which is quite similar to PHP, syntax-wise.

The docu of Virtuoso is written for a professional audience, stuff like nginx, mariadb or mongodb or others are easy reads in comparison, maybe also because they can only do one thing and not many things. Also you can apt-get install it.

@TallTed
Copy link
Member

TallTed commented Sep 22, 2022

I would also nominate the OpenLink Structured Data Sniffer which helps reveal and/or confirm the structured data, in islands or markup or otherwise, found in any given web page.

Would you consider that similar to the RDF Browser Firefox plug-in that Andreas Harth mentioned today on the semantic-web list?

There's some similarity at a casual first glance, but I think there are significant differences.

First off, the RDF Browser plug-in is Firefox-only, while OSDS supports all major browsers including Chrome, Safari, Edge, Opera, and Firefox.

Second, I think relatedly, RDF Browser last pushed a new version in April 2021, and though there have been a couple of develop merges since, it appears to be rather stagnant. OSDS code maintenance hasn't been well disciplined, as a number of updated builds have shipped based on the develop branch without reaching the main branch, but that should be cleaned up shortly.

Third, and I think most significantly, the RDF Browser changes the HTTP Accept: header to include several extra media types, which may change the payload and/or media type of the server's response. In contrast, OSDS parses the "normal" payload returned to the browser, handling structured data found in POSH as well as islands (including entire documents) of Turtle, JSON, Microdata, RDFa, Atom, RSS, CSV, and others.

There are other differences, but I hope that's enough to entice you to install and explore a bit, and to add OSDS to the list.

@jmkeil
Copy link

jmkeil commented Sep 22, 2022

Your single listed data store (Blazegraph) is no longer (for some years now!) supported nor maintained, as it has been subsumed into Amazon Neptune.

Ugh, thanks for catching that. I hadn't realized that they stopped supporting the open source version. That disqualifies it. :(

At the risk of blowing my (employer's) own horn, I would suggest Virtuoso Open Source be on the list, as it is the engine behind the majority of nodes in the LOD Cloud.

Sure, I'll list that at least for the moment. Thanks!

To replace Blazegraph in Wikibase, Wikimedia developers recently did an evaluation of open source alternatives for Blazegraph.

@fekaputra
Copy link

fekaputra commented Sep 22, 2022 via email

@dbooth-boston
Copy link
Collaborator Author

dbooth-boston commented Sep 22, 2022

To replace Blazegraph in Wikibase, Wikimedia developers recently did an evaluation of open source alternatives for Blazegraph.

Awesome report, and very helpful! Their criteria are heavily weighted toward high performance, but also include several criteria that are more relevant to this effort:

  • Full SPARQL 1.1 capabilities
  • Active open-source community
  • Query plan explanation
  • Query without authentication
  • Support for named graphs (quads)
  • Query builder interface (ease of use)

Based on those criteria, I've added RDF4J and Jena as RDF database candidates also.

@dbooth-boston
Copy link
Collaborator Author

Good idea @fekaputra , I've added YASGUI and OpenRefine to the list. However, I did not find license information for the grefine-rdf-extension itself. I only found license information for software that the extension uses, such as Sesame, Any23, Lucene and Xerces. Can you please add appropriate FOSS license info for the extension itself? I created an issue for it.

And thanks @TallTed , I've added OSDS too.

@afs
Copy link
Contributor

afs commented Sep 22, 2022

Add Oxigraph https://github.com/oxigraph/oxigraph as an RDF triple store.

(I have no connection with the project).

@dbooth-boston
Copy link
Collaborator Author

Added, thanks

@jmkeil
Copy link

jmkeil commented Sep 23, 2022

Now there are four… I'm skeptical this serves the intended purpose.

From RDF-LAMP.md:

  • should represent the easiest and most popular community choice in its category.

I'm further skeptical that "most popular community choice" is a helpful criteria. There are tools with a long tradition, that the community got used to whose instability, bad usability, or limited standard compliance, but which will discourage beginners. Selection criteria should better focus on stability, usability and standard compliance, which is my understanding of "easy".

@dbooth-boston
Copy link
Collaborator Author

dbooth-boston commented Sep 23, 2022

Yes, one will have to be selected, but it seemed a little premature to do that yet. I'd like to get more suggestions on the table first, and refine our selection criteria.

I'm further skeptical that "most popular community choice" is a helpful criteria. . . .

Very good point. I've just reworded that criterion in response to your comment, and moved standards compliance to its own bullet. What do you think now?

@dbooth-boston
Copy link
Collaborator Author

Issue #96 suggests Docker as an easy way to provide a sample SPARQL database. Would Docker be a good way to provide an RDF-LAMP stack in general, or would it be too limiting? Opinions? Pros/cons?

@TallTed
Copy link
Member

TallTed commented Sep 23, 2022

Is "a good way to provide an RDF-LAMP stack in general" to be another (vocalized) popularity contest?

(Docker has its boosters, as do AWS, Azure, and other cloud providers. Each has its own pros and cons. I submit that whether the people watching this issue are fully versed in any, never mind some or all, alternatives is debatable at best, and that most of us will have some degree of religiosity and/or prejudice in our declared preferences and/or recommendations. The impact of that religiosity can only be minimized [it can't be avoided entirely] by building a table of features and other requirements, and comparing implementations thereby — which will require at least one person here to have deep familiarity with each system. This would itself be a reinvention of many supposedly objective comparison tables out there on the web.)

Or is popularity now to be taken/treated as an inherent negative, as suggested by 'skeptical that "most popular community choice" is a helpful criteria'? (I don't think popularity should identify a winner on its own, but it can be useful when one of many metrics of comparison.)

@dbooth-boston
Copy link
Collaborator Author

Is "a good way to provide an RDF-LAMP stack in general" to be another (vocalized) popularity contest?

Maybe, but I'm also interested in technical rationale. Basically, I was first assuming that a bundled LAMP-like RDF stack would be released as a download that people would install natively, on their Linux, MacOS or Windows environments. But another option might be to bundle it up as a Docker image. I'm wondering whether people think that would be a better idea.

@kasei
Copy link

kasei commented Sep 23, 2022

I’ve always found Docker nice because beyond the relative portability, the associated Dockerfile tells you exactly how the software was installed and configured which is useful even if you’re not going to use the container.

@TallTed
Copy link
Member

TallTed commented Sep 24, 2022

A Docker-based starter kit seems a good direction.

There is some performance hit taken, as with any emulation-space, when running a Linux Docker image on a macOS or Windows host (and a lesser hit, I'd imagine, on a Linux host), but before that hit becomes noteworthy, folks are likely to be ready to migrate to a native installation of the various components.

@jmkeil
Copy link

jmkeil commented Oct 4, 2022

@TallTed wrote:

Or is popularity now to be taken/treated as an inherent negative, as suggested by 'skeptical that "most popular community choice" is a helpful criteria'? (I don't think popularity should identify a winner on its own, but it can be useful when one of many metrics of comparison.)

Of course not. It is just no clear indication. Neither in the one, nor in the other direction. Popular tools might be very stable as they have been tested and improved in thousands of projects, or they might contain high technical dept accumulated over the years. Unpopular tools might contain many problems yet unknown due to little use, or they might have applied from the outset the lessens learned since the release of the popular ones.

@TallTed
Copy link
Member

TallTed commented Oct 4, 2022

@jmkeil — Saying "'most popular community choice' is [not a] clear indication" is very different from saying "'most popular community choice' is [not a] helpful criteria".

The first says that popularity should not be taken to indicate a winner on its own (with which I agree), while the second says that popularity should not be considered at all (with which I do not agree).

Overall age of a project, time since the last significant update, numbers of contributors, numbers of deployers, numbers of end users... All of these are worth consideration as part of the picture; none of them should be considered sufficient on their own.

@dbooth-boston
Copy link
Collaborator Author

Related discussion about tools: https://lists.w3.org/Archives/Public/semantic-web/2024Apr/0009.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Category: tools For RDF tools primarily-tools Tool makers should address this website Central RDF website should address this
Projects
None yet
Development

No branches or pull requests

9 participants