Skip to content

Commit

Permalink
feat: upgrade to v2.0.0 with enhanced nanoid functions and optimizations
Browse files Browse the repository at this point in the history
  • Loading branch information
nik-sta authored Sep 1, 2023
1 parent b7d710b commit 70cdeae
Show file tree
Hide file tree
Showing 6 changed files with 327 additions and 46 deletions.
135 changes: 116 additions & 19 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,68 +11,165 @@ A tiny, secure, URL-friendly, unique string ID generator for Postgres.
* **Small.** Just a simple Postgres function.
* **Safe.** It uses pgcrypto random generator. Can be used in clusters.
* **Short IDs.** It uses a larger alphabet than UUID (`A-Za-z0-9_-`). So ID size was reduced from 36 to 21 symbols.
* **Portable**. Nano ID was ported to [over 20 programming languages](https://github.com/ai/nanoid/blob/main/README.md#other-programming-languages).
* **Portable**. Nano ID was ported
to [over 20 programming languages](https://github.com/ai/nanoid/blob/main/README.md#other-programming-languages).

## How to use

## Use
```sql
SELECT nanoid(); -- creates an id, with the defaults of the created nanoid() function.
SELECT nanoid(); -- Simplest way to use this function. Creates an id, with the defaults of the created nanoid() function.
SELECT nanoid(4); -- size parameter set to return 4 digit ids only
SELECT nanoid(3, 'abcdefghij'); -- custom size and alphabet parameters defined. nanoid() generates ids concerning them
SELECT nanoid(3, 'abcdefghij'); -- custom size and alphabet parameters defined. nanoid() generates ids concerning them.
SELECT nanoid(10, '23456789abcdefghijklmnopqrstuvwxyz', 1.85); -- nanoid() could generates ids more performant with a custom defined additional bytes factor.
```

```sql
CREATE TABLE mytable (
id char(21) DEFAULT nanoid() PRIMARY KEY
CREATE TABLE mytable(
id char(21) DEFAULT nanoid() PRIMARY KEY
);

or

-- To use a custom id size
CREATE TABLE mytable (
id char(14) DEFAULT nanoid(14) PRIMARY KEY
CREATE TABLE mytable(
id char(14) DEFAULT nanoid(14) PRIMARY KEY
);

or

-- To use a custom id size and a custom alphabet
CREATE TABLE mytable (
id char(12) DEFAULT nanoid(12, 'ABC123') PRIMARY KEY
CREATE TABLE mytable(
id char(12) DEFAULT nanoid(12, 'ABC123') PRIMARY KEY
);
```

## Getting Started

### Requirements

* PostgreSQL 9.4 or newer

Execute the file `nanoid.sql` to create the `nanoid()` function on your defined schema. The nanoid() function will only be available in the specific database where you run the SQL code provided.
Execute the file `nanoid.sql` to create the `nanoid()` function on your defined schema. The nanoid() function will only
be available in the specific database where you run the SQL code provided.

**Manually create the function in each database:** You can connect to each database and create the function. This function can be created manually or through a script if you have many databases. Remember to manage updates to the function. If you change the function in one database, those changes will only be reflected in the other databases if you update each function.
**Manually create the function in each database:** You can connect to each database and create the function. This
function can be created manually or through a script if you have many databases. Remember to manage updates to the
function. If you change the function in one database, those changes will only be reflected in the other databases if you
update each function.

### Adding to the default template database
## Adding to the default template database

**Use a template database:** If you often create databases that need to have the same set of functions, you could create a template database that includes these functions. Then, when you create a new database, you can specify this template, and PostgreSQL will make the new database a copy of the template.
**Use a template database:** If you often create databases that need to have the same set of functions, you could create
a template database that includes these functions. Then, when you create a new database, you can specify this template,
and PostgreSQL will make the new database a copy of the template.

Here's how to do that:

1. Connect to template1 database:
2. Then, run your nanoid() function creation code.

*If the function is only needed for specific applications, it might be better to create it manually in each database where needed or create a custom template database that includes this function and use that template when creating new databases for these applications.*
*If the function is only needed for specific applications, it might be better to create it manually in each database
where needed or create a custom template database that includes this function and use that template when creating new
databases for these applications.*

Also, note that changes to template1 won't affect existing databases, only new ones created after the changes. Existing databases will need to have the function added manually if required.
Also, note that changes to template1 won't affect existing databases, only new ones created after the changes. Existing
databases will need to have the function added manually if required.

Reference: [Template Databases](https://www.postgresql.org/docs/current/manage-ag-templatedbs.html)

## Calculating the additional bytes factor for a custom alphabet

If you change the alphabet of the `nanoid()` function, you could optimize the performance by calculating a new additional
bytes factor with the following SQL statement:

```sql
WITH input as (SELECT '23456789abcdefghijklmnopqrstuvwxyz' as alphabet)
SELECT round(1 + abs((((2 << cast(floor(log(length(alphabet) - 1) / log(2)) as int)) - 1) - length(alphabet)::numeric) / length(alphabet)), 2) as "Optimal additional bytes factor"
FROM input;

-- The resulting value can then be used f.e. as follows:
SELECT nanoid(10, '23456789abcdefghijklmnopqrstuvwxyz', 1.85);

```

Utilizing a custom-calculated additional bytes factor in `nanoid()` enhances string generation performance. This factor
determines how many bytes are generated in a single batch, optimizing computational efficiency. Generating an optimal number
of bytes per batch minimizes redundant operations and conserves memory.

## Usage Guide: `nanoid_optimized()`

The `nanoid_optimized()` function is an advanced version of the `nanoid()` function designed for higher performance and
lower memory overhead. While it provides a more efficient mechanism to generate unique identifiers, it assumes that you
know precisely how you want to use it.

🚫 **Warning**: No checks are performed inside `nanoid_optimized()`. Use it only if you're sure about the parameters you'
re passing.

### Function Signature

```sql
nanoid_optimized(
size int,
alphabet text,
mask int,
step int
) RETURNS text;
```

### Parameters

- `size`: The desired length of the generated string.
- `alphabet`: The set of characters to choose from for generating the string.
- `mask`: The mask used for mapping random bytes to alphabet indices. The value should be `(2^n) - 1`, where `n` is a
power of 2 less than or equal to the alphabet size.
- `step`: The number of random bytes to generate in each iteration. A larger value might speed up the function but will
also increase memory usage.

### Example Usage

Generate a NanoId String of length 10 using the default alphabet set:

```sql
SELECT nanoid_optimized(10, '_-0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ', 63, 16);
```

### Tips:

- **Performance**: This function is optimized for performance, so it's ideal for scenarios where high-speed ID
generation is needed.
- **Alphabet Set**: The larger your alphabet set, the more unique your generated IDs will be, but also consider the mask
and step parameters' adjustments.
- **Customization**: Feel free to adjust the parameters to suit your specific needs, but always be cautious about the
values you're inputting.

By following this guide, you can seamlessly integrate the `nanoid_optimized()` function into your projects and enjoy the
benefits of its optimized performance.

## Using MySQL/MariaDB?

If you're using MySQL or MariaDB and you found this library helpful, we have a similar library for MySQL/MariaDB, too! Check out our [Nano ID for MySQL/MariaDB](https://github.com/viascom/nanoid-mysql-mariadb) repository to use the same capabilities in your MySQL/MariaDB databases.
If you're using MySQL or MariaDB and you found this library helpful, we have a similar library for MySQL/MariaDB, too!
Check out our [Nano ID for MySQL/MariaDB](https://github.com/viascom/nanoid-mysql-mariadb) repository to use the same
capabilities in your MySQL/MariaDB databases.

## 🌱 Contributors Welcome

- 🐛 **Encountered a Bug?** Let us know with an issue. Every bit of feedback helps enhance the project.

- 💡 **Interested in Contributing Code?** Simply fork and submit a pull request. Every contribution, no matter its size, is valued.

- 📣 **Have Some Ideas?** We're always open to suggestions. Initiate an issue for discussions or to share your insights.

All relevant details about the project can be found in this README.

Your active participation 🤝 is a cornerstone of **nanoid-postgres**. Thank you for joining us on this journey.

## Authors 🖥️
## 🖥️ Authors

* **Patrick Bösch** - *Initial work* - [itsmefox](https://github.com/itsmefox)
* **Nikola Stanković** - *Initial work* - [nik-sta](https://github.com/nik-sta)

See also the list of [contributors](https://github.com/viascom/nanoid-postgres/contributors) who participated in this project. 💕
See also the list of [contributors](https://github.com/viascom/nanoid-postgres/contributors) who participated in this
project. 💕

## License

Expand Down
1 change: 1 addition & 0 deletions dev/benchmark/a_nanoid.sql
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
-- PLACE IMPLEMENTATION A 'a_nanoid()' HERE
1 change: 1 addition & 0 deletions dev/benchmark/b_nanoid.sql
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
-- PLACE IMPLEMENTATION B 'b_nanoid()' HERE
60 changes: 60 additions & 0 deletions dev/benchmark/benchmarks.sql
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
DO
$$
DECLARE
startTime timestamp;
endTime timestamp;
durationA interval;
durationB interval;
numLoops int := 100000;
counter int;
dummyResult text;
BEGIN

-- Benchmarking A nanoid()
RAISE NOTICE '-----------------------------';
RAISE NOTICE 'Starting benchmark for A nanoid() for % loops...', numLoops;
startTime := clock_timestamp();
FOR counter IN 1..numLoops
LOOP
dummyResult := a_nanoid();
dummyResult := a_nanoid(5, '23456789abcdefghijklmnopqrstuvwxyz');
dummyResult := a_nanoid(11, '_-0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ.,');
dummyResult := a_nanoid(48, '0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ');
END LOOP;
endTime := clock_timestamp();
durationA := endTime - startTime;
RAISE NOTICE 'A nanoid() took %', durationA;
RAISE NOTICE '-----------------------------';

-- Benchmarking B nanoid()
RAISE NOTICE 'Starting benchmark for B nanoid() for % loops...', numLoops;
startTime := clock_timestamp();
FOR counter IN 1..numLoops
LOOP
dummyResult := b_nanoid();
dummyResult := b_nanoid(5, '23456789abcdefghijklmnopqrstuvwxyz');
dummyResult := b_nanoid(11, '_-0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ.,');
dummyResult := b_nanoid(48, '0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ');
END LOOP;
endTime := clock_timestamp();
durationB := endTime - startTime;
RAISE NOTICE 'B nanoid() took %', durationB;
RAISE NOTICE '-----------------------------';

-- Compare
IF durationA < durationB THEN
RAISE NOTICE 'A nanoid() is faster by %', durationB - durationA;
ELSIF durationA > durationB THEN
RAISE NOTICE 'B nanoid() is faster by %', durationA - durationB;
ELSE
RAISE NOTICE 'Both functions have comparable performance.';
END IF;

RAISE NOTICE '-----------------------------';

END
$$;

-- TODO:
-- EXTRACT(EPOCH FROM (timestamp1 - timestamp2))
-- ROUND()
73 changes: 73 additions & 0 deletions dev/test/unit_tests.sql
Original file line number Diff line number Diff line change
@@ -0,0 +1,73 @@
DO
$$
DECLARE
generated_id text;
counter int;
numLoops int := 1000;
BEGIN
-- Default parameters
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid();
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 21, 'Default nanoid length is incorrect';
ASSERT generated_id ~ '^[-_a-zA-Z0-9]*$', 'Default nanoid contains invalid characters';
END LOOP;

-- Size 12, default alphabet
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid(12);
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 12, 'Size 12 nanoid length is incorrect';
ASSERT generated_id ~ '^[-_a-zA-Z0-9]*$', 'Size 12 nanoid contains invalid characters';
END LOOP;

-- Size 25, default alphabet
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid(25);
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 25, 'Size 25 nanoid length is incorrect';
ASSERT generated_id ~ '^[-_a-zA-Z0-9]*$', 'Size 25 nanoid contains invalid characters';
END LOOP;

-- Default size (21), custom alphabet (only lowercase)
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid(21, 'abcdefghijklmnopqrstuvwxyz');
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 21, 'Size 21 (only lowercase) nanoid length is incorrect';
ASSERT generated_id ~ '^[a-z]*$', 'Size 21 (only lowercase) nanoid contains invalid characters';
END LOOP;

-- Size 15, custom alphabet (only numbers)
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid(15, '0123456789');
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 15, 'Size 15 (only numbers) nanoid length is incorrect';
ASSERT generated_id ~ '^[0-9]*$', 'Size 15 (only numbers) nanoid contains invalid characters';
END LOOP;

-- Size 17, custom alphabet (uppercase + numbers)
FOR counter IN 1..numLoops
LOOP
generated_id := nanoid(17, 'ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789');
RAISE NOTICE '%', generated_id;
ASSERT LENGTH(generated_id) = 17, 'Size 17 (uppercase + numbers) nanoid length is incorrect';
ASSERT generated_id ~ '^[A-Z0-9]*$', 'Size 17 (uppercase + numbers) nanoid contains invalid characters';
END LOOP;

-- -- Intentional false positive: use default size but with a mismatched regex pattern
-- FOR counter IN 1..numLoops
-- LOOP
-- generated_id := nanoid();
-- RAISE NOTICE '%', generated_id;
-- -- This will fail because we're purposefully using a wrong pattern
-- ASSERT generated_id ~ '^[XYZ]*$', 'Intentional false positive detected';
-- END LOOP;

RAISE NOTICE 'All tests passed successfully!';
END
$$;
Loading

0 comments on commit 70cdeae

Please sign in to comment.