-
Notifications
You must be signed in to change notification settings - Fork 178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alignment with schema.org #285
Comments
Afaik google dataset earch and schema.org stuff is on html pages, not json. Like JSON-LD isn't crawled by search engines. We originally talked about an HTML version of the spec, but a better solution seems to be STAC Browser where we can generate html from the json. So I think we may just need it so STAC Browser html does the mapping to schema.org / dcat / linked data stuff that gets crawled, and then publish that mapping as a recommendation. But a big +1 to aligning - I just think we might have an alternate route to achieving the goal. cc @mojodna |
Can Google crawl dynamically generated JS-based Vue.js pages? @simonff |
If that doesn't work @mojodna had an idea to use another tool to autogenerate the html, either ahead of time or as needed. So worst case people could crawl their own json and write out the html so it can be crawled. |
Google can crawl anything that looks like a human-oriented HTML page, though we might have to make sure the crawl goes through all the pages correctly in case of paged results. I'm talking with the Dataset Search people about setting up the process for aligning schema.org, DCAT and STAC vocabularies. |
There's lots of progress on this on #378 and we are writing up html best practices in #32 Those do not quite 'finish' this - ideally we have something in the spec that talks explicitly about this. But moving this to 'future' for now, and we'll make stac browser the focus of our alignment and then eventually pull its learnings into the spec. |
I have experience with ontologies and linked-data solutions, having worked with the ontology group at Stanford and the Stanford library on several linked-data projects. However, there is a lot of work to be done on GIS ontologies and linked-data for catalog systems. The technology that is most aligned with STAC is obviously JSON-LD, but defining the context for STAC needs some work. I'm generally open to interests and discussions about EOS metadata standards (CF) and GIS metadata as ontologies and linked data. Related projects:
Also, with regard to linking data with publications:
|
Awesome @darrenleeweber! Your help would be great. We have flirted with trying to make STAC JSON into JSON-LD, but stopped short of going all in since then we'd give up geojson compatibility, which is pretty important for working right in geospatial tools. There is a geojson-ld, but it doesn't have much traction yet, unfortunately. It would be awesome if we could get to a JSON-LD version of the core STAC JSON, especially one that is compatible with google's dataset search. STAC Browser does have a mapping to JSON-LD, example:
But it doesn't attempt to capture all the STAC fields, just to map the relevant bits for dataset search. If you want to come to the next sprint this can definitely be a topic. |
I can't make the next sprint due to company commitments on those dates, but I will keep an eye on these developments and help where I can. Consider extending an invite to @gkellogg since he is bay-area native too. Perhaps also @azaroth42 might have some interest in these catalog developments. |
Now that we have the Google Dataset Search and schema.org vocabulary, I think we should evaluate whether we want to somehow align or support schema.org with STAC. I'm not really into schema.org yet, but it seems like a good idea to make STAC datasets even better discoverable by the Google Dataset Search. Opinions? Anyone with more insights into schema.org?
cc @simonff
The text was updated successfully, but these errors were encountered: