Backup strategy for Seafile CE

Sam · March 8, 2017, 10:47am

I have been running Seafile for 2 years now. Until now I have backed up the full virtual machine (Hyper-V) to keep the data safe.

I have another server to backup now, where the server is running on bare metall instead of VM.
This is my thoughts so far:

Stop Seafile
Backup MySQL to another server
Backup Seafile DATA-folder to another server using Rsync to minimize data-transfer and keep a second identical copy.
Start Seafile

This gives me no backup-history, which I would like to have…

I have some ideas… Storing the copy on a ZFS volume with snapshots. It can give me the history feature, but it will require a ZFS pool-setup.

How are you dealing with this? Please advice me!
Thanks!

//Sam

Sam · March 10, 2017, 8:04am

Ping! Any thoughts?..

mholoubek · March 10, 2017, 9:48am

Hi

There are many options and missing information

What amount of data are we talking about ?
What connection do u have between the Seafile server and the remote backup server ?
… backup history … how many days/weeks/… ?
What kind of backup storage do you have (filesystem, amount of storage,…) ?

Rsync full backups and/or Rsnapshot are, without any fancy storage backend (ZFS, Ceph,…), the only practical solution i can think of. If you have e.g. FreeNAS as backup storage, the deduplication/compression would help to keep the amount of data low.

Michael

Lonsarg · March 14, 2017, 7:46pm

Well at the moment i live with full backups of virtual disk on which seafile data resides. It means i lose history of few months, if server goes down, for home its ok.

I already have a plan on how to do proper daily LIVE incremental backups, but am to lazy to do it. I would use btrfs, since it uses WAY less resources than ZFS and is stable enough for most features(RAID56 is not stable yet for example, but normal raids and snapshots are stable).

I would use btrfs filesystem for seafile server which can at any time make crash-consistent snapshot of harddrive(including sql databases). And than you can, while server continues operating, read from this snapshot to any incremental backup program. It can be rsync, another btrfs backup drive with snapshot capability, ZFS, Ceph,… I personally for home usage would use another btrfs for simplicity.

A step up for more serious backup would be seperate sql dump that would happen about 30-60 seconds BEFORE data snapshotting takes place(since we want all reacords in sql to have a matching record on disk, we do not care if a few orphaned files stay on disk, they do no harm). So proper sql backup + crash-consistend seafile data backup.

Or also a very good idea if taking server offline for a few seconds would work for you:
Take server offline, make a btrfs snapshot and turn server back on. This way you do not need any seperate sql backuping, plus instead of crash-consistent data you have a fully offline backup with offline time of server less than 30 seconds with a script!

Garfield · March 15, 2017, 7:12am

That sounds pretty good. Do this with cron at the mid of the night when nobody is using it (especially in home/single user environment).

troed · March 16, 2017, 12:19pm

I use the seaf-fuse filesystem and Crashplan. That way I have version controlled backups of all files in Seafile and I’m not reliant on Seafile internal formats if I ever need to rescue any files. This is all done live.

Jack · March 16, 2017, 12:39pm

But this doesn’t work with encrypted libraries, does it?

troed · March 16, 2017, 12:58pm

That’s probably correct (I assume it is) - on my system we don’t have any encrypted libraries.

saljut7 · March 16, 2017, 3:55pm

https://manual.seafile.com/extension/fuse.html:

Encrypted folders can’t be accessed by seaf-fuse.

Sam · March 20, 2017, 1:19pm

Thanks for your thoughts here.

Maybe I didn’t give so detailed information. I’m just looking for a general solution, and want to know how others are dealing with the backup strategies on Seafile CE.

I have the backupserver attached to the local network by gigabit interface.
As Seafile have history and recycle-bin, the backup are mainly for disaster recovery (not user accidents). Daily backup are enough.

As seaf-fuse are working on another way, I will indeed recover the files stored in Seafile, but I will lose all Seafile “metadata”. Good as last resort, but not my preferred way.

Regarding the BTRFS - proposed by @Lonsarg. Should I store both mySQL and the Seafile data-directory on a disk with this filesystem. Also, will it require Seafile to be stoped during snapshots? Anyone that have this solution up and running successfully? This seems to require the Backup server to also use a snapshot capable filesystem for simplification of a incremental backup / restore.

Thanks for more discussion on this topic!
Until I have a good “on the guset backup” I will back the VM up from the host… It consumes more disk, time but are a secure way…

//Sam

mercury · March 20, 2017, 6:21pm

I’ve been dealing with this since CE v. 1.X and pretty much follow the sequence proposed in the original post. The servers I see are on Ubuntu and generally serve under 25 users. I can expect to see encrypted libraries among users.

For what it’s worth, I have had one major event in the time I’ve been using and managing Seafile CE that involved a corrupted MySQL server after an upgrade. This was during the Seafile 2.X days. I have not had a problem with Seafile itself (fingers crossed) although I do the minor upgrades religiously shortly after they come out.

For the sake of the new-comer to Seafile, I’d recommend a periodic full backup using Redo or Clonezilla, Redo being drop-dead easy to use. In your case you seem to suggest you backup your VM file which gets you to the same place.

As for scripted solutions you might want to reference some posts on the German Seafile site too. It’s now called SyncWerk but was formerly Seafile.de (I think). I seem to recall some discussions there like these (in English):

https://forum.syncwerk.com/t/seafile-terminal-client-autostart-backup-routine/1580

https://forum.syncwerk.com/t/my-backup-scripts/3607

You can take bits and pieces as needed.

I think if I had to build an industrial-strength system I would base storage on ZFS. I have not done this yet but I might try a ZFS-based system running a VM or Docker instance of Seafile on FreeNAS or TrueNAS (iXsystems). RAIDZ2 and Snapshots could be very useful. In addition, in case seaf-fuse offers any advantages in your system, the server could be used for file storage as well.

I have not tried backup servers like Bacula but would like to. It seems like another steep learning curve but one worth doing as an R&D project on the side. By the way, FreeNAS/TrueNAS does run Bacula as a plug-in, I believe.

Hope this helps.

Sam · June 19, 2018, 1:24pm

Time for a bump of my old thread!

I have been using complete VM bacukp until now. But I have another setup on physical hardware. I need a good backupsolution here.
I have some ideas:

Backup SQL DB
Backup Seafile Data
Rsync to another location
I would like to have some kind of retention policy: (keep nightly for 7 days, weekly for 4 weeks, monthly for 3 months)…

Ideas / scripts?..

bionade24 · June 19, 2018, 3:48pm

If you have a powerful server, you don’t have to stop ZFS for making snaphots. I rsync the snapshots after it. But don’t use btrfs, it’s beta for over 5 years now and Red Hat gave it up to make it stable. Just use it if your system isn’t powerful enough for ZFS.

Sam · June 21, 2018, 8:54am

Thanks!

I would prefere not to use ZFS for this as it require a high-end server as a backup-storage.

Other ideas?..

Thanks!

shoeper · June 21, 2018, 9:09am

Why is that? Ok … I had the issue that it wasn’t (isn’t?) available for arm, but I wouldn’t say using ZFS requires a high-end server.

Sam · June 21, 2018, 1:10pm

Okay, I have only tried ZFS in FreeNAS and Nas4Free. FreeNAS requires high-end.
But I’m maybe not to familiar with ZFS.
Is it the best solution? ZFS on Ubuntu?
//Sam

bionade24 · June 21, 2018, 4:59pm

There are two options: ZFS on FUSE and ZFS on Linux. ZFS on FUSE is horrible. Actually, the performance on BSD should be much better than on Linux, but I nver used FreeBSD with ZFS. Of course, if you don’t have 16GB RAM or more you have to limit the usage of ZFS. But you can use it like a normal system if you let it use swap.

shoeper · June 21, 2018, 7:31pm

ZoL (ZFS on Linux) works pretty well for me since 2014.

Sam · June 25, 2018, 8:06pm

Okay, It seems to be a good idea to let the ZFS file-system deal with the backup-history instead of doing some own retention-policy based file-hierarchy.
But if I cannot have a Linux ZFS based backup-machine?.. Are there any other options? Just want to be sure to investigate different paths before a decision.

Thanks!

//Sam

Simsala · June 25, 2018, 8:36pm

There are already a lot of good ideas at the forum, please also have a look at:

Backup Strategy seafile CE 4TB of data