YAHB is a deduplicating file copy tool, intended for backup use. Deduplication works on the file-level with NTFS hardlinks.
The latest release is available HERE.
YAHB is also available via winget. Simply open a command prompt and run
winget install asdfjkl.YAHB
Consider the following scenario: Your have a folder
C:\MyFiles
for which you want to create backups. Assume for simplicity that the folder contains only two files:
C:\MyFiles\movie.avi (huge 600 MB movie file)
C:\MyFiles\todo.txt (your todo-list, few kilobytes)
The large movie.avi
doesn't change. Your todo.txt
is changed almost daily, but it's only a very small file. Let's further assume, it's March 1st, 2019, the current time is 15:12, and you are creating backup with YAHB to F:\Backup
. Then YAHB will simply copy C:\MyFiles
as follows:
F:\Backup\201903011512\C__\MyFiles\movie.avi
F:\Backup\201903011512\C__\MyFiles\todo.txt
Suppose the next day (March 2nd, same time) you want to create another backup to the same location. The file todo.txt
has changed inbetween, but the file movie.avi
has not. YAHB will locate the last previous backup folder, and identify those files that changed, and those that didn't. Running YAHB again will result in the following backup:
F:\Backup\201903021512\C__\MyFiles\movie.avi -> hardlink to F:\Backup\201903011512\C__\MyFiles\movie.avi
F:\Backup\201903021512\C__\MyFiles\todo.txt
The folder F:\Backup\201903021512
now only takes a few kilobytes, instead of 600 MB, since movie.avi
is only stored once on the drive F:
, but two NTFS hardlinks are pointing to it.
Moreover:
- If at some point, you decide to delete the folder
F:\Backup\201903011512
(but keepF:\Backup\201903021512
), NTFS will detect that there is a hardlink pointing tomovie.avi
. It will delete the folder, but keepmovie.avi
on the disk. Same for the other way round. - You always have a 1:1 copy of your current files at hand. In case of a desaster, there is no proprietary backup format to extract from, re-order your file structure etc. In the above example, just copy the latest version of
MyFiles
back, and all your data are there - maximum recoverability. - If a file is currently locked (i.e. opened for read/write), YAHB supports to still create a copy of that file using Windows Volume Shadow Copy Service. This is useful, if you want to create a backup in the background while working with the computer, i.e. creating backups of documents while you have them still open in Word/LibreOffice, or creating a backup of your Thunderbird or Firefox Profile folder, while still writing mails or browsing the web.
Just unzip to a folder, open a command-prompt and run yahb
.
YAHB is currently 64 bit only. YAHB will likely run fine on Windows 7 and 8.1, but only Windows 10 is supported.
- When copying to a locally attached drive, the target drive MUST be NTFS-formatted. Otherwise hardlinks cannot be created.
- When copying to a network share, things are more complicated. Basically the underlying file system must support hardlinks, and must expose hardlink creation in such a way, that Windows API commands can be used to create hardlinks. This is supported with i.e. SAMBA when Unix Extensions are enabled. Fortunately, most typical NAS solutions like Synology or QNAP suport this and work out-of-the-box.
YAHB requires Microsoft NET Framework 4.7.2 or higher. The following versions of Windows ship with suitable versions of NET Framwework by default, i.e. you don't need to install anything if you run:
- Windows 10, version 1809 and later
If you are running an earlier version of Windows, download and install the latest Microsoft NET Framework here.
Only if you want to make use of Windows Volume Shadow Copy Service to copy files currently in use, you need to additionally install Microsoft Visual Studio C++ 2019 Redistributable. You need the version for 64bit systems, i.e. vc_redist.x64.exe
. Note that it's very likely that this is already installed on your system by other programs.
Windows via default has a MAX_PATH
restriction, i.e. can't handle path names longer than 260 characters - a relict from old MS-DOS times.
Since YAHB keeps the original folder structure but in addition adds a timestamp and drive letter -- like e.g. F:\Backup\201903021512\C__\MyFiles
it is possible to run into problems as the destination path is then longer than 260 characters.
There are two possible workarounds:
- keep the maximal path length in mind and if required shorten folder names prior to creating a backup.
- For YAHB version 1.0.5 or later: Windows 10, version 1607 and later are able to remove the
MAX_PATH
restriction via a registry entry. LocateHKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\FileSystem
and look for an entry calledLongPathsEnabled
. Change that value to1
if it isn't already set to1
.
Note: When backing up to a network drive (i.e. SAMBA), it is unclear if long paths (> 260 characters) work in practice with YAHB 1.0.5. and the above mentioned registry entry. Feedback is appreciated!
You can support the development of YAHB by donating via Paypal.
Note: To use the option /vss
you MUST run YAHB with elevated rights, i.e. from an elevated command prompt (Run as Administrator
).
YAHB (Yet Another Hardlink-based Backup-Tool)
Version 1.0.7.0
Copyright (c) 2019 - 2021 Dominik Klein
Syntax:: yahb.exe <source-dir> <target-dir> [<options>]
source-dir:: source directory (i.e. C:\MyFiles)
target-dir:: target directory (i.e. D:\Backups)
TYPICAL EXAMPLE:
yahb c:\MyFiles d:\Backup /s /xf:*.tmp
will copy all files and the directory structure from c:\MyFiles
to d:\Backup\YYYYMMDDHHMM, including all subdirectories. Yahb will
also look for previous backups of c:\MyFiles in d:\Backup, and if
a file has not changed, it will create a hardlink to that location.
Moreover, all files with ending .tmp will be skipped.
OPTIONS
/copyall :: copy ALL files. Otherwise the following directory
patterns and file types are excluded:
DIRECTORIES:
- 'System Volume Information'
- 'AppData\Local\Temp'
- 'AppData\Local\Microsoft\Windows\INetCache'
- 'C:\Windows'
- '$Recycle.Bin'
FILES AND PLACEHOLDERS:
- hiberfil.sys
- pagefile.sys
- swapfile.sys
- *.~
- *.temp
/files:PAT1;PAT2;... :: copy only files that match the supplied
file patterns (like *.exe)
/help :: display this help screen
/list :: do not copy anything, just list all files
/log:FILENAME :: write all output (log) to a textfile FILNAME.
If FILENAME exists, it will be overwritten
/+log:FILENAME :: same as /log:FILENAME, but always append, i.e.
do not not overwrite FILENAME if it exists.
/pause :: after finishing, wait for the user to press
ENTER before closing the program. This
prevents a command - prompt from vanishing
after finishing if run e.g. by Windows' RUNAS
command
/s :: also copy all SUBDIRECTORIES of <source-dir>
/tee :: even if /log:FILENAME or /+log:FILENAME is
chosen, still write everything additionally
to console output.
/verbose :: by default, only the progress and errors
are output to the console/log. In verbose
mode, all created files and directories
are listed - note that for large copy
operations, this frequent output to console
will slow down the overal operation
/vss :: If a file is currently in use, and cannot be
accessed, try to still copy that file by using
Windows' Volume Shadow Copy Service.
YOU NEED TO RUN YAHB WITH ELEVATED (ADMIN)
RIGHTS FOR THIS TO WORK.
/xd:DIR1;DIR2;... :: eXclude directories dir1, dir2, and so forth.
I.e. if DIR is provided here, any (full)
directory path that contains DIR is skipped
/xf:PAT1;PAT2;... :: eXclude files with filename PAT1, PAT2 and so
forth. PAT can also be a file pattern like *.tmp
/? :: display this help screen