Replace calls to panic! with Result types #98

jmacdonald · 2017-08-31T02:32:00Z

After doing a quick search, there are 8 occurrences of panic! in syntect. It'd be great to have these errors bubbled up using Result types with custom errors, instead. With the existing implementation, it's awkward to handle these, and because these methods don't signal the possibility of these errors, you won't realize there's an issue until you hit a runtime panic and have to work backwards to find the source.

The text was updated successfully, but these errors were encountered:

trishume · 2017-08-31T02:41:31Z

Agreed, it'll be unfortunate that parsing will sometimes fail with regex compilation errors, but better than probably better than panicking. I'm unlikely to work on this myself any time soon though.

jmacdonald · 2017-08-31T02:44:25Z

Totally fine. Custom error-chain error setup is still fresh in my mind, so I may tackle both this and #94 shortly.

robinst · 2018-05-07T00:55:48Z

From discussion on #156 (comment):

For checking that regexes compile, we might want to do that when we load the syntaxes from the yaml files (before linking them). For patterns with placeholders, we'd just use a dummy value (a or whatever) before compiling them.

This should not slow down normal use cases where loading from yaml happens once, is serialized to a binary dump, and then later that dump is loaded before matching. Loading the dump would not compile the regexes until they are needed, just like the current behavior.

That gives us a much earlier (and more useful) check if there's a problematic regex in a syntax definition. Note that compiling could still fail at runtime if a dump from a previous version of syntect is loaded, and something about the regex engine changed in syntect, but that should be minimal compared to what we have now. And we could still make matching work with Results later.

When Syntect fails to highlight a file it often does so by panicking. Most often these issues come from us making use of syntax definitions which simply aren't supported by Syntect very well (e.g. they use some new or obscure ST3 feature). There is an upstream issue to [return a `Result` type instead of panicking](trishume/syntect#98) when this occurs. Prior to this change, a user requesting syntect_server to highlight a bad file (i.e. hitting a case in the syntax definition not supported by Syntect) would result in `syntect_server` dying. This has been a known issue for a while, but in practice hasn't been that bad because these cases are relatively rare and Kubernetes / Docker restarts the process very quickly anyway. However, when it does occur it terminates all ongoing highlighting requests which causes blips that users see. After this change, we handle these panics by catching and unwinding the stack. This isn't perfect / ideal / idiomatic Rust code (see the `catch_unwind` docs), but it does solve the problem and is a better approach than e.g. adding more replicas of this service. Fixes sourcegraph/sourcegraph#3164

…20) When Syntect fails to highlight a file it often does so by panicking. Most often these issues come from us making use of syntax definitions which simply aren't supported by Syntect very well (e.g. they use some new or obscure ST3 feature). There is an upstream issue to [return a `Result` type instead of panicking](trishume/syntect#98) when this occurs. Prior to this change, a user requesting syntect_server to highlight a bad file (i.e. hitting a case in the syntax definition not supported by Syntect) would result in `syntect_server` dying. This has been a known issue for a while, but in practice hasn't been that bad because these cases are relatively rare and Kubernetes / Docker restarts the process very quickly anyway. However, when it does occur it terminates all ongoing highlighting requests which causes blips that users see. After this change, we handle these panics by catching and unwinding the stack. This isn't perfect / ideal / idiomatic Rust code (see the `catch_unwind` docs), but it does solve the problem and is a better approach than e.g. adding more replicas of this service. Fixes sourcegraph/sourcegraph#3164

This change makes syntect_server resilient to the two classes of problems we've seen in production usage of it: 1. Some specific language grammar/file pairs can cause syntect to panic internally. This is usually because syntect doesn't implement a specific sublime-syntax feature in some way and it [panics instead of returning result types](trishume/syntect#98). 2. Much rarer, some specific language/grammar file pairs can cause syntect to get stuck in an infinite loop internally -- never to return and consuming an entire CPU core until it is restarted manually. Previously we tried to solve #1 through stack unwinding (c5773da), but since the 2nd issue above also appeared it proved to not be sufficient on its own. It is still useful, though, because it can do per-request recovery of the first failure scenario above and as such it will be added back in. Even without stack unwinding, http-server-stabilizer helps both cases above by running and monitoring replicas of syntect_server. See the README in https://github.com/slimsag/http-server-stabilizer for details. It is important to note that all this does is stop these individual file failures from harming other requests to syntect_server. They are still issues on their own, and logging and Prometheus monitoring is now in place for us to identify when this is occurring and in which file it occurred so we can track down the issue and make small reproduction cases to file and fix upstream. Since only one instance of syntect_server was previously running and we now run multiple, more memory is needed. Each instance requires about 1.1 GB at peak (depending on which languages are used). The default is now to run 4 workers, so 4.4 GB is the minimum required and 6 GB is suggested. In the event only one worker is ran (via setting the env var `WORKERS=1`), stability is still greatly improved since the 2nd failure case above can only last a short period of time instead of until the container is restarted manually. Part of sourcegraph/sourcegraph#5406

* use http-server-stabilizer + 4 worker subprocesses This change makes syntect_server resilient to the two classes of problems we've seen in production usage of it: 1. Some specific language grammar/file pairs can cause syntect to panic internally. This is usually because syntect doesn't implement a specific sublime-syntax feature in some way and it [panics instead of returning result types](trishume/syntect#98). 2. Much rarer, some specific language/grammar file pairs can cause syntect to get stuck in an infinite loop internally -- never to return and consuming an entire CPU core until it is restarted manually. Previously we tried to solve #1 through stack unwinding (c5773da), but since the 2nd issue above also appeared it proved to not be sufficient on its own. It is still useful, though, because it can do per-request recovery of the first failure scenario above and as such it will be added back in. Even without stack unwinding, http-server-stabilizer helps both cases above by running and monitoring replicas of syntect_server. See the README in https://github.com/slimsag/http-server-stabilizer for details. It is important to note that all this does is stop these individual file failures from harming other requests to syntect_server. They are still issues on their own, and logging and Prometheus monitoring is now in place for us to identify when this is occurring and in which file it occurred so we can track down the issue and make small reproduction cases to file and fix upstream. Since only one instance of syntect_server was previously running and we now run multiple, more memory is needed. Each instance requires about 1.1 GB at peak (depending on which languages are used). The default is now to run 4 workers, so 4.4 GB is the minimum required and 6 GB is suggested. In the event only one worker is ran (via setting the env var `WORKERS=1`), stability is still greatly improved since the 2nd failure case above can only last a short period of time instead of until the container is restarted manually. Part of sourcegraph/sourcegraph#5406 * Add Dart+Kotlin support; upgrade to Rocket v0.4 * Continue serving requests when highlighting an individual file fails (#20)

Enselic · 2022-01-08T06:59:20Z

I plan on looking into this for 5.0. This clippy command can be used to find potential panics:

cargo clippy -- --deny clippy::panic --deny clippy::unwrap_used --deny clippy::expect_used

there are currently 38 of these (of which 5 are direct calls to panic!()). Some of them are safe to keep of course.

Canop · 2022-01-21T21:46:09Z

Related: I didn't want to use the version 5 now and I had to fix some panics because it crashed broot, so I converted them to errors in this commit. This commit isn't directly usable as I had to start from an old commit but I don't think it would be hard to do the same on the whole up to date codebase.

In order to fast release a new version of broot, I'm currently using a fork ( https://github.com/Canop/syntect ). Would there be interest in having a no-panic-4.6 branch hosted directly here to help waiting for the new 5.0 version ? Or would that just make the syntect project too messy ?

Enselic · 2022-02-20T10:00:20Z

@Canop Thanks for sharing! I will make sure to cover all cases of Result that you add in that commit before I close this issue.

I will not use your API verbatim however. For example, I'm not a fan of the CrashError type, because:

It is (by my estimation) not idiomatic Rust error handling
I think it is better to be specific. For example, I plan on introducing an ContextMissing error for cases like panic on a svelte file #421.

If you want to embark on this journey together with me, I would appreciate your review on #424 which is one step towards resolving this issue. It sets the structure for future work on this issue by introducing a new enum syntect::Error.

Enselic · 2022-03-27T10:22:30Z

I created a PR (#432) that removes all direct panic!() calls. I tried to carefully make all API changes you did in your commit @Canop. Please read the PR description for details. One thing to note in particular is that there seems to be a slight performance penalty (1-3%-ish) in propagating errors, but personally I think that is OK. I am curious to hear what others in this issue think.

In the PR, all direct panics are gone, and regressions are prevented with a CI check. However, I have not taken care of replacing all .unwrap()s and .expect()s with error propagation, so panics can still happen. And to be honest my mental momentum is starting to run out, so I don't think I will have the energy to take care of that. We can leave this issue open until someone else gets around to do that I suppose.

Enselic · 2022-06-18T17:17:41Z

I think we can consider this fixed by the 5.0.0 release. I don't know of any way to make syntect panic. There are probably such cases, but please open new specific issues with steps to reproduce and we can take care of that on a case by case basis.

This was referenced Apr 27, 2018

Do not fail on regex errors sharkdp/bat#18

Closed

Panic on some input sharkdp/bat#28

Closed

Failing to parse some sublime-syntax files #156

Closed

This was referenced May 25, 2018

Panic due to erroneous regex-rewriting #164

Closed

Check regular expressions for compile errors #165

Merged

slimsag mentioned this issue Apr 19, 2019

Continue serving requests when highlighting an individual file fails sourcegraph/syntect_server#20

Merged

slimsag mentioned this issue Jun 9, 2021

why our syntax highlighter is in bad shape / has a lot of "tech debt", and what we should do about it sourcegraph/sourcegraph-public-snapshot#21942

Closed

Enselic added the enhancement label Nov 10, 2021

Enselic mentioned this issue Nov 25, 2021

Do not compress the outer parts of a SyntaxSet #398

Merged

Enselic mentioned this issue Jan 6, 2022

Prepare for 5.0.0: Update CHANGELOG.md and version #409

Merged

keith-hall mentioned this issue Jan 21, 2022

panic on a svelte file #421

Closed

Enselic mentioned this issue Feb 20, 2022

Make parts of html API return Result with new syntect::Error #424

Merged

Enselic mentioned this issue Mar 1, 2022

Make highlight_line() and parse_line() return Result #426

Merged

This was referenced Mar 12, 2022

4.7 is not uploaded to crates.io #428

Closed

yaml_load.rs: Propagate FromUtf8Error errors from regexes #430

Closed

Enselic mentioned this issue Mar 27, 2022

Replace direct panics with Errors #432

Merged

Enselic closed this as completed Jun 18, 2022

tobiasvl mentioned this issue Jul 24, 2022

Document errors #440

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace calls to panic! with Result types #98

Replace calls to panic! with Result types #98

jmacdonald commented Aug 31, 2017

trishume commented Aug 31, 2017

jmacdonald commented Aug 31, 2017

robinst commented May 7, 2018

Enselic commented Jan 8, 2022 •

edited

Loading

Canop commented Jan 21, 2022 •

edited

Loading

Enselic commented Feb 20, 2022

Enselic commented Mar 27, 2022

Enselic commented Jun 18, 2022 •

edited

Loading

Replace calls to panic! with Result types #98

Replace calls to panic! with Result types #98

Comments

jmacdonald commented Aug 31, 2017

trishume commented Aug 31, 2017

jmacdonald commented Aug 31, 2017

robinst commented May 7, 2018

Enselic commented Jan 8, 2022 • edited Loading

Canop commented Jan 21, 2022 • edited Loading

Enselic commented Feb 20, 2022

Enselic commented Mar 27, 2022

Enselic commented Jun 18, 2022 • edited Loading

Enselic commented Jan 8, 2022 •

edited

Loading

Canop commented Jan 21, 2022 •

edited

Loading

Enselic commented Jun 18, 2022 •

edited

Loading