Basics of rclone

Stairway to the Cloud

Article from Issue 187/2016
Author(s):

As a simple and reliable command-line backup utility that supports popular cloud storage services, rclone makes a perfect tool for maintaining an off-site backup of your data. This article can help you to get started.

Cloud storage is cheap nowadays, and you have plenty of storage providers to choose from. So, you have no excuse for not having an off-site backup system to keep your files safe. There is a fly in the ointment, however. Many cloud storage services want you to use their own proprietary graphical client applications. Worse still, some services don't provide Linux clients at all. Fortunately, there is rclone, a small open source utility that can talk to many popular cloud storage services, including Google Drive, Amazon S3, and hubiC. Additionally, rclone can handle local filesystems, so you can use it for local backup, too. The utility is straightforward in use, so there is no learning curve to speak of.

Deploying rclone

Written in Go, rclone is distributed as a self-contained binary file with no dependencies, and it will happily run on the x86, AMD64, and ARM platforms. Installing rclone is not difficult, but it does require a bit of manual work. Grab the latest release of the tool for the appropriate Linux platform from the project's website [1]. Unpack the downloaded archive and switch to the resulting directory in the terminal. Then, copy the binary executable to the /usr/local/bin/ directory and change the file's permissions:

sudo cp rclone /usr/local/bin/
sudo chown root:root /usr/local/bin/rclone
sudo chmod 755 /usr/local/bin/rclone

To install man pages containing rclone's documentation, run the following commands:

sudo mkdir -p /usr/local/share/man/man1
sudo cp rclone.1 /usr/local/share/man/man1/
sudo mandb

That's all there is to it. Alternatively, you can compile rclone from source, which is also simple to do. First, make sure that the Go programming language is installed on your system. On Debian and Ubuntu, you can install Go by running the apt-get install golang command as root. Create the ~/go directory, and point $GOPATH to it using the export GOPATH=$HOME/go command. Then, run the following command to download and compile rclone:

go get github.com/ncw/rclone

Once the operation is finished, you'll find the compiled binary file in the ~/go/bin directory. Copy the rclone file and change its permissions as described above.

Configuring rclone

To make rclone work with your preferred cloud storage service, you need to create a configuration file. The utility includes the special config subcommand that allows you to do just that. For example, suppose you want to use rclone with the hubiC service [2]. Run the rclone config command, press n when prompted to set a configuration password, and name the configuration profile remote. Then, enter the number corresponding to the hubiC option in the list of supported services. Leave the Hubic Client Id and Hubic Client Secret options empty and select the auto-configuration option. This will open the default browser and obtain an access token. Check and confirm the generated settings, and you are done.

Rclone uses a default browser for authenticating with cloud storage services, which works well on any system with a graphical desktop environment. But, what if you want to configure rclone on a remote machine without a graphical desktop? It can be done, but you still need another machine with a browser. Here is how it works.

Run the rclone config command and configure the options as described above until you reach the Use auto config? prompt. Press n and note the exact rsync authorize command (in the case of hubiC, the command is rsync authorize "hubic"). Run this command on the machine with a browser, copy the obtained access token, and paste it into the result > prompt on the remote machine.

If you've already configured rclone on a local machine, you can simply copy the .rclone.conf file to the remote machine. Usually, the configuration file is stored in the home directory (i.e., ~/.rclone.conf), but you can find the exact path by running the rclone -h command; The actual path to the configuration file will be shown next to the --config option.

Once rclone has been configured, run the rclone lsd remote: command, which returns all containers. Listing 1 shows the output of this command in the case of hubiC.

Listing 1

Output of rclone lsd remote

01     7639189 0001-01-01 00:00:00        21 default
02           0 0001-01-01 00:00:00         0 default_segments
03
04 Transferred:            0 Bytes (   0.00 kByte/s)
05 Errors:                 0
06 Checks:                 0
07 Transferred:            0
08 Elapsed time:       600ms

Using rclone

The rclone utility supports a few simple subcommands and options that allow you to access, manage, and use the remote storage; the copy and sync subcommands are probably the most important among them. As the name suggests, the copy subcommand copies the contents of the source directory to the remote destination.

This subcommand doesn't transfer unchanged files (it checks them by size, modification time, and md5sum hashes), and it doesn't delete files from the destination directory. If the destination directory doesn't exist, rclone automatically creates it. Most rclone commands have the simple rclone [OPTION] [SUBCOMMAND] <source> <destination> syntax, and here is what the command that copies the contents of a specified directory to the remote destination looks like:

rclone copy /path/to/source remote:destination

Similar to copy, the sync subcommand transfers files from the source directory to the destination, skipping unchanged files. When sync encounters files that don't exist in the source directory, the subcommand deletes them from the destination. In other words, sync keeps both source and destination directories in sync by modifying the destination. Because this operation (as well as some other rclone actions) is irreversible, it makes sense to test it first, for which rclone provides the handy --dry-run option. Add this option to the rclone sync command to check what files will be copied and deleted:

rclone --dry-run sync /path/to/source remote:destination

When using rclone, keep in mind that it copies and syncs the contents of directories and not the directories themselves. So, the rclone sync /home/user/Documents remote:Backup command copies the contents of the Documents directory (and not the directory and files in it) to the remote Backup directory.

The check subcommand can come in handy when you need to ensure that the files in the source and destination directories match:

rclone check /path/to/source remote:destination

This command compares files by their sizes and md5sum hashes and then shows a list of files that don't match.

Besides copy and sync, rclone supports several subcommands that let you view and manage remote storage. The lsd subcommand, for example, can be used to list all directories (also called containers and buckets) in the remote destination, and the ls subcommand shows all files in a specified remote directory:

rclone lsd remote:
rclone ls remote:dir

If you need to create or delete a remote directory, you can use the mkdir and rmdir subcommands for that:

rclone mkdir remote:new_dir
rclone rmdir remote:old_dir

The last command can remove a directory only if it's empty. If you want to delete a directory and its contents, use the purge subcommand:

rclone purge remote:old_dir

As a command-line tool, rclone supports a number of options that control its behavior. The --bwlimit option, for example, lets you limit the bandwidth available to rclone. This can be useful when the machine running rclone shares the Internet connections with other clients. Limiting the bandwidth ensures that other machines can access the Internet at a reasonable speed during the copy or sync operations. The bandwidth limit can be specified in kilobytes, megabytes, or gigabytes using the k, M, and G suffixes:

rclone --bwlimit=15M sync /path/to/source remote:destination

The command above limits the bandwidth to 15MBps.

By default, rclone runs four simultaneous file transfer operations, and you can adjust this number using the --transfers option. If you have a fast connection and remote storage service, you can increase the number of transfers:

rclone --transfers=7 copy /path/to/source remote:destination

Conversely, you might want to reduce the number of transfers if the remote service frequently times out or your Internet connection is on the slow side.

The --dry-run option mentioned previously allows you to test rclone operations without applying any changes. Finally, if you run rclone unattended, you might want to use the --log-file option as shown here

rclone --log-file=rclone.log sync /path/to/source remote:destination

to save rclone's output to a file for later reference.

Buy this article as PDF

Express-Checkout as PDF
Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES
 
SUBSCRIPTIONS
 
TABLET & SMARTPHONE APPS
Get it on Google Play

US / Canada

Get it on Google Play

UK / Australia

Related content

  • The sys admin's daily grind: Rclone

    Having a good backup is a matter of course for sys admin columnist Charly Kühnast, but devices could still fall victim to fire or theft some day. Because he has enough free space on Google Drive, he doesn't need to search long for a solution. The only thing missing is the right tool.

  • Google Drive File Sync with rclone
  • Website Backup

    Shared hosting is the best way for first-time webmasters to get started. But what do you do about backup?

  • FOSSPicks

    Graham nearly made it through an entire month of FOSSPicks without an esoteric audio discovery. And then he found VeeSeeVSTRack.

  • Light Browsing

    This Chromium port can run inside any console, with minimal resources, and is a great tool for making old computers really useful – and learning programming along the way.

comments powered by Disqus