Add API to correctly insert GuidV7 to SQLServer (BigEndian), so no index fragmentation occurs #2999

jogibear9988 · 2024-11-08T07:38:37Z

jogibear9988
Nov 8, 2024

Describe the bug

If GuidsV7 are added to NET9, SQLServer inserts them in a way, so the index still gets fragmented (see issue: dotnet/runtime#103658)

There should be a new setting in the SQLClient to switch GUID serialization to "TryWriteBytes(destination, bigEndian: true)" so also SQLServer can profit from the advantages of the new guid format.

jogibear9988 · 2024-11-08T07:41:37Z

jogibear9988
Nov 8, 2024
Author

see also: dotnet/runtime#103658 (comment)
with a comment from @tannergooding

    This would then be something to raise with the SQL Server team, which is fully external to the .NET Libraries team and has their own management, customers (including beyond .NET), developers, back-compat bars, API review process, and other considerations.

0 replies

jogibear9988 · 2024-11-08T07:45:38Z

jogibear9988
Nov 8, 2024
Author

https://feedback.azure.com/d365community/idea/69d13f8a-b9fb-ed11-a81c-000d3adb7ffd
https://feedback.azure.com/d365community/idea/cb67ad60-2433-ef11-a296-6045bdbc5330

0 replies

MichelZ · 2024-11-08T07:46:33Z

MichelZ
Nov 8, 2024

#2946 is a prereq for this

0 replies

edwardneal · 2024-11-08T07:46:41Z

edwardneal
Nov 8, 2024

It's also worth referencing dotnet/efcore#33579, which covers a similar problem within EF Core.

0 replies

jogibear9988 · 2024-11-08T07:51:21Z

jogibear9988
Nov 8, 2024
Author

It's also worth referencing dotnet/efcore#33579, which covers a similar problem within EF Core.

I thought maybe add a Parameter to ConnectionString, like "SerializeGuidsBigEndian:true" could work. So also EFCore or any other ORM could benefit.

0 replies

ErikEJ · 2024-11-08T08:11:33Z

ErikEJ
Nov 8, 2024

@jogibear9988 Should it not be possible to simply read the new Version property? If so, a much better UX I think.

0 replies

jogibear9988 · 2024-11-08T08:58:53Z

jogibear9988
Nov 8, 2024
Author

@jogibear9988 Should it not be possible to simply read the new Version property? If so, a much better UX I think.

??? Wich Version?
You could also create a GUID V7 before NET9, and stor it already in the Database. If you now change the byte order, you maybe break old Programs. I would create a setting, and the read/and write all GUID's as BigEndian.

0 replies

Wraith2 · 2024-11-08T11:29:28Z

Wraith2
Nov 8, 2024

If the setting to change how guids are handled is in the connection string then you can't migrate a database from using one type to another without losing the ability to use guids that already exist and that doesn't seem like an acceptable change to me.

Can you provide some simple tests for 1) "normal" guids and 2) v7 guids that show how they should be written and read back? I suspect that we need to have dedicated read and write methods because the type does not contain information about whether the bytes should be interpreted in a specific way or not. Consider how you would want to have both old and new guids used as parameter values. How do you specify that the contents are one or the other?

If we need to extend only Sql* types like SqlDataReader with a method like GetGuidV7 then that's ok but if we need to push that up to DbDataReader then that requires a framework change and I don't know what the opinion of other providers (postgres, mysql, sqlite) would be about that change. /cc @roji

0 replies

jogibear9988 · 2024-11-08T15:48:06Z

jogibear9988
Nov 8, 2024
Author

Second thought about my suggestion, it will not work. It will, but then the guids will have another value when written with dotnet as when you read them without the connection parameter. I think the only correct solution would be a new Guid Type in Sqlserver, that serializes the same as all other databases do.

0 replies

edwardneal · 2024-11-08T18:13:50Z

edwardneal
Nov 8, 2024

Now that I've got a bit of time to respond properly!

I think you're right; a blanket change to the way GUIDs are written to the network wouldn't work. It'd mean that every primary key value would be interpreted differently, and potentially that a developer could run a query from SQL Server Management Studio/Azure Data Studio and see a GUID with one value, but run the same query via M.D.S and see another value.

The real problem isn't the way that a TDS library transports GUIDs to the server, it's the way that a client generates GUIDs in the first place. The EF Core issue I linked is designed to address that for one library.

A better approach may be to create a utility method similar to SqlGuid.NewGuid(), which wraps Guid.NewGuid() and makes the result sortable by SQL Server. I'm not sure where this'd need to go to though. If that's a practical addition to something in the SqlTypes namespace, it'd be an API request in dotnet/runtime. If the SqlTypes namespace isn't accepting changes, maybe we could have a SqlGuidExtensions utility class in our SqlTypes namespace.

0 replies

Wraith2 · 2024-11-08T18:55:09Z

Wraith2
Nov 8, 2024

The real problem isn't the way that a TDS library transports GUIDs to the server, it's the way that a client generates GUIDs in the first place.

Guids are just data. SqlClient doesn't generate them at all. We're currently in a situation where anything that is identified as a guid gets handled in a specific way and that works. What we have to add is some way to have a second handler without having a second type and that means the type information must be maintained out of band by the user.

The SqlParameter input case is mentioned is indicative of this. If you just put a guid into an object typed variable then there isn't enough information. The same problem happens in the other direction. If you have some rowdata which is typed as uniqueidentifier by sql server there is no way to know if that's old or new format. So whatever happens I don't see any other way to proceed than having special methods to both read and write the new format. We'll also need a wrapper type for object typed location like SqlParameter.

0 replies

vanbukin · 2024-11-08T18:55:58Z

vanbukin
Nov 8, 2024

Depending on what type of data is used in the database column for storage, indexes may or may not be fragmented (uniqueidentifier / binary(16))
dotnet/runtime#103658 (comment)

0 replies

edwardneal · 2024-11-08T19:25:50Z

edwardneal
Nov 8, 2024

The real problem isn't the way that a TDS library transports GUIDs to the server, it's the way that a client generates GUIDs in the first place.

Guids are just data. SqlClient doesn't generate them at all.

I'm talking about a client-side operation, not M.D.S. Sorry for the confusion here.

We're currently in a situation where anything that is identified as a guid gets handled in a specific way and that works. What we have to add is some way to have a second handler without having a second type and that means the type information must be maintained out of band by the user.

The SqlParameter input case is mentioned is indicative of this. If you just put a guid into an object typed variable then there isn't enough information. The same problem happens in the other direction. If you have some rowdata which is typed as uniqueidentifier by sql server there is no way to know if that's old or new format. So whatever happens I don't see any other way to proceed than having special methods to both read and write the new format. We'll also need a wrapper type for object typed location like SqlParameter.

The methods you're describing make complete sense, and if that's definitely the problem to solve then I agree - it's the right way to solve it. I just don't think that SqlClient's conversion of Guids is the core problem. As I read it, the core problem seems to be that if a .NET application generates and inserts many GUIDs in parallel into a table with an index covering a uniqueidentifier column, that index will be fragmented. We'd be able to patch it within SqlClient with the method you've described, but I think the core problem is that the GUIDs being generated aren't currently "SQL Server-friendly." I've suggested implementing SqlGuid.NewId() to try to fix the problem at the point of generation.

0 replies

tannergooding · 2024-11-08T19:32:58Z

tannergooding
Nov 8, 2024

SQL Server is using ToByteArray (or TryWriteBytes) on the Guid when storing the data, since the bool bigEndian parameter is new and they aren't passing it in, this means the data is serialized in little endian format.

However, when it later loads it as SqlGuid to do a comparison, it ends up doing so in byte order (effectively comparing assuming its in big endian format): https://source.dot.net/#System.Data.Common/System/Data/SQLTypes/SQLGuid.cs,8332f43230838509,references -- Edit: This might be the wrong code link, I seem to have misplaced where the actual logic that shows the problem was

What this means is that the issue isn't in how the data is serialized nor is the issue in how the data is compared. The root cause of the problem here is that serialization and comparison do not agree with eachother on the data format. If you changed the serializer to write as big endian that would fix the issue (as now it would match what the comparer expects). Identically if you instead changed the comparer to compare using little endian that would also fix the issue (as now it would match what the serialization is storing).

This behavior cannot be "fixed" for back-compat reasons, but some new APIs could be exposed that allow users to get the right thing to happen. Similarly, users can rely on the current behavior and workaround it by swapping the endianness of their produced Guid prior to giving it to the SQL client (which will have the net effective of it being serialized as big endian).

0 replies

jogibear9988 · 2024-11-08T20:46:41Z

jogibear9988
Nov 8, 2024
Author

But a new ColumnType like Guid2 could fix it, or am I wrong?

Also a new Darabase flag, that could change default guid comparison?

0 replies

edwardneal · 2024-11-11T20:42:48Z

edwardneal
Nov 11, 2024

Thanks for the explanation tannergooding. I agree that the right way to sort UUIDs is as 128-bit uints, and I think it brings us full circle to the core problem - SQL Server sorts them differently. The root cause of the fragmentation is that v7 UUIDs are only effectively sorted in order of unix_ts_ms (and only reduce index fragmentation) when the database server sorts UUIDs in the way we expect, and we don't have any way to force SQL Server to use that algorithm. IMO, the remaining topic is purely a question of where we adapt to that:

When transporting the UUIDv7, sending the bytes in an order such that the timestamp bytes in the UUIDv7 GUID are transparently rearranged to fit the SQL Server sort order;
At the point of generating it, adding a method which generates a "SQL Server sort compatible" UUIDv7 GUID up-front.

We could change the way we read and write a GUID on the transport. There are a few different options for that - if we wanted to be explicit, we could use a Uuidv7 SqlDbType, a SqlDataReader.GetUuidv7 method, etc. If we wanted to make it "just work" then we could detect a GUID with version 7 and handle it differently. Either of these approaches would let us send and receive UUIDv7 GUIDs.

One risk which I can see with that approach is how clients would switch from using normal Guids to UUIDv7 GUIDs when other people have a record of the ID to look up, such as the below:

Guid id = Guid.Parse(userInput);
SqlCommand cmd = CreateCommandWithUuidv7Parameter(sql: "SELECT TOP 1 Name FROM MyTable WHERE Id = @Id", id);

using SqlDataReader reader = cmd.ExecuteReader();

Assert.IsTrue(reader.Read());

In the simple case (table contains only reshuffled UUIDv7 uniqueidentifiers) this'll be fine. In a real-world case, where the table contains a mixture of UUIDv7 and non-UUIDv7 uniqueidentifiers the results will have changed. While there are workarounds, it feels to me like we'd be opening a pit of failure.

Placing the responsibility on the transport layer also opens a slightly weird quirk: a test similar to the below would fail, because the Guid would have its bytes rearranged on the transport but the varchar would not.

Guid id = Guid.CreateVersion7();
SqlCommand cmd = CreateCommandWithUuidv7Parameter(sql: "SELECT @Id, CAST(@Id AS varchar(max))", id);
using SqlDataReader reader = cmd.ExecuteReader();

reader.Read();

Assert.AreEqual(id, reader.GetUuidv7(0)); // True
Assert.AreEqual(id.ToString(), reader.GetString(1)); // False

My opinion's thus slightly in favour of creating a utility method somewhere - whether we create a SqlGuidHelper.NewSequentialId method in SqlClient, or request a SqlGuid.NewSequentialId method in runtime - which generates a SqlGuid which SQL Server would consider sequential, based on transforming Guid.CreateVersion7 as vanbukin suggested. This leaves the transport of GUIDs untouched, and means that switching from using Guid.NewGuid to the new method becomes trivial.

0 replies

jogibear9988 · 2024-11-11T21:37:02Z

jogibear9988
Nov 11, 2024
Author

My opinion's thus slightly in favour of creating a utility method somewhere - whether we create a SqlGuidHelper.NewSequentialId method in SqlClient, or request a SqlGuid.NewSequentialId method in runtime - which generates a SqlGuid which SQL Server would consider sequential, based on transforming Guid.CreateVersion7 as vanbukin suggested. This leaves the transport of GUIDs untouched, and means that switching from using Guid.NewGuid to the new method becomes trivial.

We use on our side different Databases, and also Guids could be generated in the Frontend, so this would not help.
We generate V7 Guids, so it works correctly for other databases, but not for SQLServer

0 replies

jogibear9988 · 2024-11-11T21:40:07Z

jogibear9988
Nov 11, 2024
Author

https://devblogs.microsoft.com/oldnewthing/20190426-00/?p=102450

0 replies

jogibear9988 · 2024-11-11T21:42:50Z

jogibear9988
Nov 11, 2024
Author

mareek/UUIDNext#2 (comment)

Don't use SQL Server (just kidding). You should file a bug, though, that UNIQUEIDENTIFIER doesn't work well for your use case and that we need a sortable UUID type in SQL Server.

0 replies

Wraith2 · 2024-11-11T22:53:39Z

Wraith2
Nov 11, 2024

Ok. So we've been around all the arguments back and forth. Are we back to the point where we can discuss the api instead of taking cheap shots at sql server?

Methods can be provided which will accept a little endian guid and convert it to big endian. These would be SqlClient specific. You could use them directly or EFCore could use them on your behalf and provide some sort of attribute [SqlServerBigEndianGuid] for decorating cto properties. GetGuidBE, GetSqlGuidBE? We'll need a wrapper of some sort for SqlParameter, do we just force users to manually create a big endian SqlGuid?

0 replies

edwardneal · 2024-11-11T23:16:57Z

edwardneal
Nov 11, 2024

Thanks @Wraith2. I'll be able to write a concise reply at some point.

I'm (tentatively) in favour of something like SqlGuid.NewSequentialId or GuidExtensions.NewSequentialId to generate a new UUIDv7 value from scratch and reshuffle the bytes into a layout which matches SQL Server's sort algorithm; this aligns with the NEWSEQUENTIALID function in SQL Server. I'd then add a couple of Wrap functions to allow running those transformations on existing UUIDv7 values, and add variations for SqlGuid.

public static class GuidExtensions
{
    // Generates a UUIDv7 and reshuffles its bytes to align with SQL Server's sort algorithm.
    public static Guid NewSequentialId() { }
    public static SqlGuid NewSqlSequentialId() { }

    // Wraps user input, performing the same byte reshuffling. Throws if version7Uuid.Version is not 7.
    public static Guid WrapVersion7(Guid version7Uuid) { }
    public static SqlGuid WrapSqlVersion7(SqlGuid version7Uuid) { }
}

Once that Guid has been created by the helper class, I don't think it actually needs any transformation layers on SqlDataReader or SqlParameter - the return value would just be another Guid, all transformation taken care of. EF Core could then update its SequentialGuidValueGenerator to take advantage of it and close efcore#33579.

0 replies

Wraith2 · 2024-11-11T23:40:11Z

Wraith2
Nov 11, 2024

People can do that already. The problem is that for guids that aren't created in that way they'll capable of being fragmented. You shouldn't need to use the database (or driver) as a source guids in order to make sure it works. We should be able to take a guid create by an application or taken from a postgres database and be able to insert it into the database in a specific way. Pushing it up the stack to creation time leaks implementation details around sql server that people shouldn't need to care about.

0 replies

tannergooding · 2024-11-12T01:55:06Z

tannergooding
Nov 12, 2024

I'm (tentatively) in favour of something like SqlGuid.NewSequentialId or GuidExtensions.NewSequentialId to generate a new UUIDv7 value from scratch and reshuffle the bytes into a layout which matches SQL Server's sort algorithm; this aligns with the NEWSEQUENTIALID function in SQL Server. I'd then add a couple of Wrap functions to allow running those transformations on existing UUIDv7 values, and add variations for SqlGuid.

I think this is focusing too much on the wrong details as well, notably. Any new API should do the "right thing" with any Guid value. It doesn't really matter if its UUIDv7, UUIDv8 (user-defined/custom format), an arbitrary 128-bit GUID that doesn't follow any existing RFC, etc.

UniqueIdentifier are already stored in a particular way, that doesn't really need to change and is already fine as is. What is really needed is an equivalent to SqlGuid that does the comparison of the bytes in the right order and a way to tell SQL Client to use that it instead.

That is, the needed fix is functionally similar to providing a custom IComparer to a Dictionary, rather than using the built-in default.

0 replies

jogibear9988 · 2024-11-12T06:19:14Z

jogibear9988
Nov 12, 2024
Author

My initial Idea when I opend the Ticket here was that the SQLClient could change the bytes when writeing to SQL server. I didn't think about that then the Guid is wrong when I for example would use it with another connection library to SQLServer or directly in SQL.

So I think now, the only Solution is a fix directly in SQLServer, for example a Database Flae to change default Guid Handling, so it is stored/compared in another way

0 replies

edwardneal · 2024-11-12T08:29:48Z

edwardneal
Nov 12, 2024

I'm slightly in favour of leaving the transport layer alone because of a scenario like the below.

Guid id = Guid.Parse(userInput);

using var connection = new SqlConnection(connStr);

connection.Open();

using var command = new SqlCommand("SELECT COUNT(1) FROM MyTable WHERE Id = @Id", command);

command.Parameters.Add("@Id", SqlDbType.UniqueIdentifier).Value = id;

var rowCount = (int)command.ExecuteScalar();

Console.WriteLine($"Row count: {rowCount}");

The contents of MyTable might look as such:

Id
4e6c67ef-28bd-4f98-9fb1-3f2309f984b0
00acd809-4313-46fa-a774-a97cc67a3d62
4c5713e9-3a67-4f1c-ad3e-f1c5ff6b1c7d
019311df-a2d8-7adf-976a-9ab9b0b0461b
1f688481-9f18-4ba2-a33a-0ae45c0bb5bc

All told, it's a pretty trivial example: we accept user input, then print the number of records with a matching ID. Most of the IDs here are not V7 UUIDs, so its index fragmentation will eventually grow.

If we now implement some kind of byte-shuffling within SqlClient, our sample code behaviour has changed.

Byte-shuffle all GUIDs

Now, if the user inputs a GUID of 4e6c67ef-28bd-4f98-9fb1-3f2309f984b0 the row count will be zero. SqlClient returns incorrect results.

Byte-shuffle all GUIDs if the parameter has a new SqlDbType

If the user inputs a GUID of 4e6c67ef-28bd-4f98-9fb1-3f2309f984b0 the row count will be one. This result is correct.
If the user inputs a GUID of 019311df-a2d8-7adf-976a-9ab9b0b0461b the row count will be one. This result is correct.

This is an improvement. However, now insert a new UUIDv7 value into MyTable. This UUIDv7 value has been generated via Guid.CreateVersion7() and it has been passed as a parameter with the new SqlDbType type (SqlDbType.Uuidv7?) into a simple INSERT INTO SQL statement. The output of CreateVersion7 has been returned to the user, but the byte-shuffle has inserted a different value into the table.

If the user provides the value they've been given to the code snippet, the row count will be zero. SqlClient returns incorrect results.

Same as above, with changes to the code snippet

Perhaps we try to fix this by using a code snippet as so:

Guid id = Guid.Parse(userInput);

using var connection = new SqlConnection(connStr);

connection.Open();

using var command = new SqlCommand("SELECT COUNT(1) FROM MyTable WHERE Id = @Id", command);

// This has changed: the SqlDbType is now the same as the SqlDbType passed to the INSERT statement.
command.Parameters.Add("@Id", SqlDbType.Uuidv7).Value = id;

var rowCount = (int)command.ExecuteScalar();

Console.WriteLine($"Row count: {rowCount}");

The user provides the UUIDv7 value given in the previous scenario to the code snippet, the row count is one. Bug appears to be fixed.
The user re-tests with a pre-existing GUID of 4e6c67ef-28bd-4f98-9fb1-3f2309f984b0, and the row count will be zero. SqlClient returns incorrect results - it has the same problem as the previous section, but impacting all non-UUIDv7 values.

I don't see these situations as corner cases: it's the result of using an existing database which has a combination of Uuidv7 and non-Uuidv7 GUIDs in a single table.

Working scenario

For this to work, we'd need to change the snippet as so:

Guid id = Guid.Parse(userInput);

using var connection = new SqlConnection(connStr);

connection.Open();

using var command = new SqlCommand("SELECT COUNT(1) FROM MyTable WHERE Id IN (@Id_1, @Id_2)", command);

// Note that there are now two parameters: one for the old format, one for the new format.
command.Parameters.Add("@Id_1", SqlDbType.UniqueIdentifier).Value = id;
command.Parameters.Add("@Id_2", SqlDbType.Uuidv7).Value = id;

var rowCount = (int)command.ExecuteScalar();

Console.WriteLine($"Row count: {rowCount}");

This works, but it's not obvious - and I'd personally say that it's overkill when all the user wants is to insert a GUID which doesn't fragment the index covering its column. Separately, while this is unlikely, we no longer have the guarantee of "at-most-one-record" we'd get from a direct equality check to a primary key column.

Other variations

We might want to turn this problem into an edge case by deciding that we'll only byte-shuffle UUIDv7 Guids with a new SqlDbType. This solve the problem for people who only use the runtime, but it still means that it'll affect somebody who already generates their own UUIDv7 Guids.

Another way to handle this might be a separate Uuid type, and only byte-shuffle these. I've got no technical objection to this, although it feels a little consistent for the runtime to store a Uuidv7 in a standard Guid but for SqlClient to have a dedicated type.

If I'm missing something obvious and this type of case isn't a problem then I've not got any technical objection to writing Guids differently in the transport layer.

0 replies

tannergooding · 2024-11-12T08:56:15Z

tannergooding
Nov 12, 2024

The storage is correct and the loading of the guid into SqlGuid is correct, this all round trips today to any GUID the user authors.

It’s really just an issue with the comparer function internally serializing to a stack local area of memory and then comparing bytes in the wrong order

7 replies

jogibear9988 Nov 13, 2024
Author

The storage is correct and the loading of the guid into SqlGuid is correct, this all round trips today to any GUID the user authors.

It’s really just an issue with the comparer function internally serializing to a stack local area of memory and then comparing bytes in the wrong order

Cause of that I now thought, a flag for changing the comparison in Sqlserver would be preferable

MichelZ Nov 13, 2024

I also think this is a case of: You can do that yourself today with generating a GUID that suits what SQL Server can sort.
The only "real" solution otherwise would be a uniqueidentifier2 column type in SQL Server that fixes the behavior

Wraith2 Nov 13, 2024

Changing SqlServer is out of scope for this repository.
Adding some sort of fake uniqueidentifier2 in SqlClient and attempting to detect when to use it would be error prone and a mistake.
Changing the way guids are interpreted per connection string prevents mixing BE and LE guids which makes the feature impossible to use on existing databases, this would result in the feature being almost useless.

vanbukin Nov 19, 2024

@Wraith2 Well, in general, the problem under discussion has already been solved in MySqlConnector through connection string parameters, depending on which the behavior of reading and writing Guid changes. It might be worth considering this option. It will not require any changes to existing APIs or the addition of new data types.

Wraith2 Nov 19, 2024

If all they needed was connection string parameters then they don't have the same problem we do. I'd guess there was a metadata difference that allowed the behaviour to be automatic.

cheenamalhotra · 2024-11-13T02:41:59Z

cheenamalhotra
Nov 13, 2024
Maintainer

Appreciate the feedback and participation! Please use discussions for topics like these until the author plans to propose something concrete as a feature that is ready for review by the maintainers.

0 replies

jogibear9988 · 2024-11-13T08:56:34Z

jogibear9988
Nov 13, 2024
Author

Also I think we do not need a special Guid creator function, UUIDNext already provides this

0 replies

vanbukin · 2024-11-15T08:41:32Z

vanbukin
Nov 15, 2024

The entire discussion can be replaced by the function

static Guid ReorderGuidV7ToUniqueIdentifierOrder(Guid guid)
{
    Span<byte> src = stackalloc byte[16];
    Span<byte> dst = stackalloc byte[16];
    guid.TryWriteBytes(src, true, out _);
    dst[0] = src[12];
    dst[1] = src[13];
    dst[2] = src[14];
    dst[3] = src[15];
    dst[4] = src[10];
    dst[5] = src[11];
    dst[6] = src[8];
    dst[7] = src[9];
    dst[8] = src[6];
    dst[9] = src[7];
    dst[10] = src[0];
    dst[11] = src[1];
    dst[12] = src[2];
    dst[13] = src[3];
    dst[14] = src[4];
    dst[15] = src[5];
    return new Guid(dst, false);
}

3 replies

Wraith2 Nov 15, 2024

Ok. where are you planning to put that and how is it going to not break my existing data?

tannergooding Nov 15, 2024

👍 that technically works around the issue, but it fundamentally changes the Guid that will be stored/retrieved which can break other downstream things.

It's like if you have an int32 with the value of 2 and store it as little-endian, but then read it as big-endian, then you get back 0x02000000 instead (33_554_432).

The issue here isn't in how the Guid is stored or retrieved back out, as in both cases the Guid remains the original value the user provided. The issue is entirely in the comparer doing the wrong thing (and something which doesn't actually match big or little-endian ordered GUID/UUIDs at that)

Notably, if a comparer existed that did the right comparison order, not only would "sortable" GUID values (like UUIDv7) have an increase in performance; but general comparisons would as well, as it could be done in many 4x operations, rather than 16.

edwardneal Nov 15, 2024

SQL Server's NEWID and NEWSEQUENTIALID functions both return UUIDv1 values - NEWID is documented as being compliant with RFC4122, NEWSEQUENTIALID is documented as being a wrapper over the UuidCreateSequential function (with an endianness reversal layered on top.)

If SQL Server's expecting to deal primarily with UUIDv1 values, the comparison order becomes slightly more logical. Its comparer is comparing the UUIDv1 value's node ID, then the clock sequence, then the time. The structure of a UUIDv4 value is filled with mostly random, so the unusual order that a GUID's bytes were compared in wouldn't make a huge difference to the index fragmentation. The tradeoff comes when it encounters a UUIDv7 value, and SQL Server reads its rand_b field as a node ID...

jogibear9988 · 2024-11-15T14:36:40Z

jogibear9988
Nov 15, 2024
Author

It was never a problem to create a Guid wich works in SQLServer, it's also what UUIDNext provides.
The problem is, you need special Code for SQLServer, where the others database work out of the box.

0 replies

Add API to correctly insert GuidV7 to SQLServer (BigEndian), so no index fragmentation occurs #2999

Describe the bug

Replies: 36 comments · 12 replies

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 8, 2024 Author

jogibear9988 Nov 11, 2024 Author

jogibear9988 Nov 11, 2024 Author

jogibear9988 Nov 11, 2024 Author

jogibear9988 Nov 12, 2024 Author

Byte-shuffle all GUIDs

Byte-shuffle all GUIDs if the parameter has a new SqlDbType

Same as above, with changes to the code snippet

Working scenario

Other variations

jogibear9988 Nov 13, 2024 Author

cheenamalhotra Nov 13, 2024 Maintainer

jogibear9988 Nov 13, 2024 Author

jogibear9988 Nov 15, 2024 Author

Replies: 36 comments 12 replies

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 8, 2024
Author

jogibear9988
Nov 11, 2024
Author

jogibear9988
Nov 11, 2024
Author

jogibear9988
Nov 11, 2024
Author

jogibear9988
Nov 12, 2024
Author

jogibear9988 Nov 13, 2024
Author

cheenamalhotra
Nov 13, 2024
Maintainer

jogibear9988
Nov 13, 2024
Author

jogibear9988
Nov 15, 2024
Author