New PR with Devin's complete changes #507

pmeredit · 2024-12-11T15:52:52Z

This was generated completely by Devin AI with some prompting. Just checking to see what the fuzz results are

isabelatkinson · 2024-12-11T16:24:46Z

@pmeredit did you want a review on this one?

pmeredit · 2024-12-11T16:45:35Z

@pmeredit did you want a review on this one?

Sorry, not yet! I'm just seeing how these ai generated fuzz tests work. Bson-rust won't evergreen patch for me for some reason.

pmeredit · 2024-12-11T22:12:51Z

.evergreen/config.yml

@@ -154,7 +157,25 @@ functions:
    - command: shell.exec
      params:
        script: |


These evergreen changes are completely AI written except for the echo statement. Pretty impressive

pmeredit · 2024-12-11T22:15:29Z

fuzz/generate_corpus.rs

+    let utf8_cases = doc! {
+        "empty": "",
+        "null_bytes": "hello\0world",
+        "unicode": "🦀💻🔒",


Devin seems to have developed a sense of humor 😂

pmeredit · 2024-12-12T19:04:42Z

I'm not sure why force pushing to my topic branch closed this PR

pmeredit · 2024-12-12T19:04:57Z

reopen

pmeredit · 2024-12-12T19:51:23Z

.evergreen/config.yml

@@ -13,15 +13,18 @@ stepback: true
 command_type: system

 # Protect ourself against rogue test case, or curl gone wild, that runs forever


Yes, devin actually updated this comment

pmeredit · 2024-12-12T20:15:14Z

@isabelatkinson I think it's worth looking at this now. This is almost entirely AI generated.

…but it is created regardless. I could update Devin on this, but I want to move on to other things with Devin.

abr-egn

“Motive," the construct said. "Real motive problem, with an AI. Not human, see?"
"Well, yeah, obviously."
"Nope. I mean, it's not human. And you can't get a handle on it. Me, I'm not human either, but I respond like one. See?"
"Wait a sec," Case said. "Are you sentient, or not?"
"Well, it feels like I am, kid, but I'm really just a bunch of ROM. It's one of them, ah, philosophical questions, I guess...." The ugly laughter sensation rattled down Case's spine. "But I ain't likely to write you no poem, if you follow me. Your AI, it just might. But it ain't no way human.”

― William Gibson, Neuromancer

In general, I'm personally going to have a high value bar that AI-generated PRs will need to clear - they need the same sort of focused line-by-line review that a PR from an entirely unknown contributor does, and it's harder to have a conversation about motivation or tradeoffs.

Was the prompt given here on the general level of "add security-focused fuzz tests" and it picked the cases itself, or did it have guidance on which specific areas to focus on?

abr-egn · 2024-12-16T16:37:53Z

fuzz/Cargo.toml

 [package]
 name = "bson-fuzz"
 version = "0.0.1"
 authors = ["Automatically generated"]
 publish = false
+edition = "2021"


I'd generally prefer to do edition bumps in their own PRs, but it doesn't look like it had any impact here otherwise.

Devin was not pleased that we didn't have an edition 😂

abr-egn · 2024-12-16T16:39:00Z

fuzz/fuzz_targets/malformed_length.rs

+use bson::RawDocument;
+
+fuzz_target!(|buf: &[u8]| {
+    if buf.len() >= 4 {


Why the length guard? RawDocument::from_bytes should handle the smaller cases fine (and if it doesn't we want to know!)

Devin got this from the docs and/or source, I agree it would be better to remove this guard.

abr-egn · 2024-12-16T16:46:48Z

fuzz/fuzz_targets/serialization.rs

+        RawBsonRef::Double(d) => {
+            if d.is_nan() {
+                Some(RawBsonRef::Double(f64::NAN).to_raw_bson())
+            } else if d.is_infinite() {


Nit: this case can be collapsed into the final else.

Presumably Devin would fix this kind of issue if asked. You're definitely right. In this case I think I'll fix it myself.

abr-egn · 2024-12-16T16:50:41Z

fuzz/fuzz_targets/serialization.rs

+                None
+            }
+        }
+        RawBsonRef::ObjectId(id) => Some(RawBsonRef::ObjectId(id).to_raw_bson()),


This (and all the other cases with no extra logic) can just be collapsed down to a single

other => Some(other.to_raw_bson()),

This is interesting. I often don't like catch-alls because they can cause errors when new types are added, but I don't think the bson spec will be adding types any time soon.

abr-egn · 2024-12-16T16:56:01Z

fuzz/fuzz_targets/serialization.rs

+use libfuzzer_sys::fuzz_target;
+use std::str::FromStr;
+
+fn convert_bson_ref(bson_ref: RawBsonRef) -> Option<RawBson> {


I'm not sure I understand the intent of this function (or of the fuzz test in this file, for that matter). This function converts from a borrowed to an owned value (e.g. what to_raw_bson does) and along the way does some sort-of validation that duplicates what the bson library does in deserialization. If the goal is to reimplement the validation as defense in depth, I think having a validate method that doesn't do the ownership conversion would be much simpler; if it's to have a fuzz test that serialized bytes match deserialized bytes, I don't think this is needed at all.

The handling of RawBsonRef::Double is particularly confusing. Why construct a fresh instance with f64::NAN in the d.is_nan() case?

Yes, this is what it did when told to compare the input to the output

abr-egn · 2024-12-16T16:58:54Z

fuzz/fuzz_targets/serialization.rs

+    }
+}
+
+#[derive(Debug, Arbitrary)]


Why is this an Arbitrary struct rather than the fuzzed [u8] slice the other fuzz targets use?

I rewrote this test entirely. Going to have to say Devin failed on this one.

abr-egn · 2024-12-16T17:00:21Z

fuzz/fuzz_targets/string_handling.rs

+                    }
+                    RawBsonRef::Binary(b) if b.subtype == BinarySubtype::Generic => {
+                        // Test UTF-8 validation on binary data
+                        let _ = std::str::from_utf8(b.bytes);


This ... isn't really fuzz testing the bson library at this point, right?

Actually, this is just wrong. Why would a generic binary be utf8?

abr-egn · 2024-12-16T17:08:45Z

src/spec/mod.rs

As a matter of style, we strongly prefer src/foo.rs over src/foo/mod.rs - makes keeping track of editor tabs easier :)

It did this because it added spec/fmt.rs also. I could move that code into spec.rs if you prefer, though?

abr-egn · 2024-12-16T17:10:25Z

src/spec/mod.rs

@@ -23,6 +23,10 @@

 use std::convert::From;

+mod fmt;
+#[allow(unused_imports)]
+pub use self::fmt::*;


Also as a matter of style, we avoid glob imports. Since all that file had is the one impl, I think that can just get inlined into this mod.

I should have read this before my last comment, hah

pmeredit · 2024-12-16T19:29:19Z

“Motive," the construct said. "Real motive problem, with an AI. Not human, see?"
"Well, yeah, obviously."
"Nope. I mean, it's not human. And you can't get a handle on it. Me, I'm not human either, but I respond like one. See?"
"Wait a sec," Case said. "Are you sentient, or not?"
"Well, it feels like I am, kid, but I'm really just a bunch of ROM. It's one of them, ah, philosophical questions, I guess...." The ugly laughter sensation rattled down Case's spine. "But I ain't likely to write you no poem, if you follow me. Your AI, it just might. But it ain't no way human.”

― William Gibson, Neuromancer

In general, I'm personally going to have a high value bar that AI-generated PRs will need to clear - they need the same sort of focused line-by-line review that a PR from an entirely unknown contributor does, and it's harder to have a conversation about motivation or tradeoffs.

Was the prompt given here on the general level of "add security-focused fuzz tests" and it picked the cases itself, or did it have guidance on which specific areas to focus on?

Aha! At first it was "add security-focused fuzz tests", then I talked to it a bit more on the serialize test in particular because its first pass was to simply check that serialize didn't crash, whereas I demand that the output roundtrip with the input.

pmeredit · 2024-12-17T15:26:55Z

@abr-egn I'm also not convinced the malformed_length test is adding any additional benefity, wdyt?

abr-egn

LGTM modulo malformed_length removal.

abr-egn · 2024-12-17T16:29:13Z

@abr-egn I'm also not convinced the malformed_length test is adding any additional benefity, wdyt?

Agreed - the value there is in the corpus generation, there's no need to have a test that just deserializes when that's very solidly covered by all the rest.

pmeredit commented Dec 11, 2024

View reviewed changes

pmeredit closed this Dec 12, 2024

pmeredit force-pushed the topic/devin_bson_fuzzing_complete branch from 17654e8 to f1a08f8 Compare December 12, 2024 18:59

pmeredit added 3 commits December 12, 2024 14:11

Check in Devin's patch

1839b0d

artifacts dir does not mean crashes occur, don't feel like telling Devin

0bb5da2

Remove that extraneous shell script

35fb75a

isabelatkinson reopened this Dec 12, 2024

pmeredit commented Dec 12, 2024

View reviewed changes

pmeredit requested a review from isabelatkinson December 12, 2024 20:14

pmeredit added 3 commits December 13, 2024 11:34

Devin thinks the presence of an artifacts directory implies crashes, …

b44f491

…but it is created regardless. I could update Devin on this, but I want to move on to other things with Devin.

Devin moved the spec file

0d24c9f

Merge branch 'main' into topic/devin_bson_fuzzing_complete

ec09601

abr-egn self-requested a review December 16, 2024 15:47

abr-egn reviewed Dec 16, 2024

View reviewed changes

pmeredit added 4 commits December 16, 2024 16:23

Cleanup some of Devin's silliness

aabac7c

Fix the strange serialization test

f490e93

Use assert

ecfec8a

Still need to worry about double nan

100f70e

pmeredit requested a review from abr-egn December 17, 2024 15:27

abr-egn approved these changes Dec 17, 2024

View reviewed changes

abr-egn removed the request for review from isabelatkinson December 17, 2024 20:17

Remove redundant test

81d4a09

pmeredit merged commit 896a5e1 into mongodb:main Dec 17, 2024
1 check was pending

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New PR with Devin's complete changes #507

New PR with Devin's complete changes #507

pmeredit commented Dec 11, 2024

isabelatkinson commented Dec 11, 2024

pmeredit commented Dec 11, 2024

pmeredit Dec 11, 2024

pmeredit Dec 11, 2024

pmeredit commented Dec 12, 2024

pmeredit commented Dec 12, 2024

pmeredit Dec 12, 2024

pmeredit commented Dec 12, 2024

abr-egn left a comment

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

abr-egn Dec 16, 2024

pmeredit Dec 16, 2024

pmeredit commented Dec 16, 2024

pmeredit commented Dec 17, 2024

abr-egn left a comment

abr-egn commented Dec 17, 2024

		@@ -13,15 +13,18 @@ stepback: true
		command_type: system

		# Protect ourself against rogue test case, or curl gone wild, that runs forever

New PR with Devin's complete changes #507

New PR with Devin's complete changes #507

Conversation

pmeredit commented Dec 11, 2024

isabelatkinson commented Dec 11, 2024

pmeredit commented Dec 11, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pmeredit commented Dec 12, 2024

pmeredit commented Dec 12, 2024

Choose a reason for hiding this comment

pmeredit commented Dec 12, 2024

abr-egn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pmeredit commented Dec 16, 2024

pmeredit commented Dec 17, 2024

abr-egn left a comment

Choose a reason for hiding this comment

abr-egn commented Dec 17, 2024