Skip to content

Commit

Permalink
Add config file for search scraper
Browse files Browse the repository at this point in the history
Config for `docs-scraper`, which populates the search index.

QA Steps: n/a

No related task.

Signed-off-by: Katrina Prosise <katrina.prosise@foundries.io>
  • Loading branch information
kprosise committed Aug 12, 2024
1 parent f1b2669 commit 7b447eb
Showing 1 changed file with 37 additions and 0 deletions.
37 changes: 37 additions & 0 deletions scraper.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
{
"index_uid": "UID",
"start_urls": [
{
"url": "https://docs.foundries.io/latest/"
}
],
"stop_urls": [],
"selectors": {
"lvl0": {
"selector": "title",
"global": true,
"default_value": "Documentation"
},
"lvl1": "h1",
"lvl2": "h2",
"lvl3": "h3",
"lvl4": "h4",
"lvl5": "h5",
"lvl6": "h6",
"text": "p"
},
"min_indexed_level": 2,
"scrap_start_urls": true,
"strip_chars": " .,;:#",
"custom_settings": {
"synonyms": {
"Linux microPlatform": [
"LmP"
],
"LmP": [
"Linux microPlatform"
]
}
}
}

0 comments on commit 7b447eb

Please sign in to comment.