initial commit

2024-01-18 20:13:37 +01:00 · 2024-01-18 20:13:37 +01:00 · ca6a4d45d1
commit ca6a4d45d1
113 changed files with 10501 additions and 0 deletions
--- a/content/blog/highly-available-nfs-slash-nas/index.md
+++ b/content/blog/highly-available-nfs-slash-nas/index.md
@ -0,0 +1,199 @@
+---
+date: 2012-03-19
+title: Highly Available NFS/NAS
+category: linux
+---
+
+Take 2 Centos Servers (nfs1 and nfs2 will do nicely) and install ELrepo
+and EPEL on them both:
+
+    yum install \
+        https://download.fedoraproject.org/pub/epel/6/i386/epel-release-6-5.noarch.rpm \
+        https://elrepo.org/elrepo-release-6-4.el6.elrepo.noarch.rpm --nogpgcheck
+
+Each of them should ideally have 2 NICS, with the secondary ones just
+used for DRBD sync purposes. We'll give these the address 10.0.0.1/32
+and 10.0.0.2/32.
+
+I am also assuming that you have disabled the firewall and SELinux -- I
+do not recommend that for production, but for testing it is fine.
+
+# DRBD Configuration
+
+Install DRBD 8.4 on the both:
+
+    yum install drbd84-utils kmod-drbd84
+
+On each node the file /etc/drbd.d/global\_common.conf should contain:
+
+    global {
+      usage-count yes;
+    }
+    common {
+      net {
+        protocol C;
+      }
+    }
+
+and /etc/drbd.d/main.res should contain:
+
+    resource main {
+      on nfs1 {
+        device    /dev/drbd0;
+        disk      /dev/sdb;
+        address   10.0.0.1:7788;
+        meta-disk internal;
+      }
+      on nfs2 {
+        device    /dev/drbd0;
+        disk      /dev/sdb;
+        address   10.0.0.2:7788;
+        meta-disk internal;
+      }
+    }
+
+On both nodes you will need to create the resource metadata:
+
+    drbdadm create-md main
+
+and start the daemons
+
+    service drbd start
+    chkconfig drbd on
+
+Now `service drbd status` will give you:
+
+    drbd driver loaded OK; device status:
+    version: 8.4.1 (api:1/proto:86-100)
+    GIT-hash: 91b4c048c1a0e06777b5f65d312b38d47abaea80 build by dag@Build64R6, 2011-12-21 06:08:50
+    m:res   cs         ro                   ds                         p  mounted  fstype
+    0:main  Connected  Secondary/Secondary  Inconsistent/Inconsistent  C
+
+Both devices or secondary and inconsistent, this is normal at this
+stage. Choose a node to be your primary and run:
+
+    drbdadm primary --force main
+
+And it start sync'ing, which will take a long time. You can temporarily
+make it faster with (on one node:
+
+    drbdadm disk-options --resync-rate=110M main
+
+Put it back again with drbdadm adjust main
+
+On your primary node you can now create a fiiesystem. I'm using ext4 for
+no good reason other than it being the default. Use whatever you are
+most comfortable with.
+
+    mkfs.ext4 /dev/drbd0
+
+# Configure NFS
+
+If you diid a minimal Centos install, then you willl need to install the
+nfs-utils package (yum install nfs-utils). Prepare your mount points and
+exports on both servers:
+
+    mkdir /drbd
+    echo "/drbd/main *(rw)" >> /etc/exports
+
+Now we do the actual NFS set up. We previously choose nfs1 as our master
+when you used it to trigger the initial sync. On nfs1 mount the
+replicated volumes, move the NFS data to it, then create symlinks to our
+replicated data.
+
+    mount /dev/drbd0 /drbd
+    mkdir /drbd/main
+    mv /var/lib/nfs/ /drbd/
+    ln -s /drbd/nfs/ /var/lib/nfs
+    umount /drbd
+
+If you get errors about not bring able to remove directories in
+/var/lib/nfs do not worry.
+
+Now a little preparation on nfs2:
+
+    mv /var/lib/nfs /var/lib/nfs.bak
+    ln -s /drbd/nfs/ /var/lib/nfs
+
+This will create a broken symbolic link, but it will be fixed when
+everything fails over.
+
+# Heartbeat Configuration
+
+Heartbeat is in the EPEL repository, so enable that and install it on
+both nodes:
+
+    yum -y install heartbeat
+
+Make sure that */etc/ha.d/ha.cf* contains:
+
+    keepalive 2
+    deadtime 30
+    bcast eth0
+    node nfs1 nfs2
+
+The values in node should be whatever `uname -n` returns.
+
+Now create `/etc/ha.d/haresources`:
+
+    nfs1 IPaddr::10.0.0.100/24/eth0 drbddisk::main Filesystem::/dev/drbd0::/drbd::ext4 nfslock nfs
+
+That is a little cryptic, so I'll explain; nfs1 is the primary node,
+IPaddr sets up a floating address on eth0 that our clients will connect
+to. This has a resource drbddisk::main bound to it, which sets our main
+to resource to primary on nfs1. Filesystem mounts /dev/drbd0 at /drbd on
+nfs1. Finally the the services nfslock and nfs are started on nfs1.
+
+Finally, it needs an authentication file in /etc/ha.d/authkeys, which
+should be chmod'ed to 600 to be only readable by root.
+
+    auth 3
+    3 md5 mypassword123
+
+You should also make sure that nfslock and nfs do not start up by
+themselves:
+
+    chkconfig nfs off
+    chkconfig nfslock off
+
+Now you can start heartbeat and check it is working:
+
+    service heartbeat start
+    chkconfig heartbeat on
+
+# Testing
+
+Running `ifconfig` on nfs1 should give you something like:
+
+    eth0      Link encap:Ethernet  HWaddr 52:54:00:84:73:BD  
+              inet addr:10.0.0.1  Bcast:10.0.0.255  Mask:255.255.255.0
+              inet6 addr: fe80::5054:ff:fe84:73bd/64 Scope:Link
+              UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
+              RX packets:881922 errors:0 dropped:0 overruns:0 frame:0
+              TX packets:1302012 errors:0 dropped:0 overruns:0 carrier:0
+              collisions:0 txqueuelen:1000
+              RX bytes:239440621 (228.3 MiB)  TX bytes:5791818459 (5.3 GiB)
+
+    eth0:0    Link encap:Ethernet  HWaddr 52:54:00:84:73:BD  
+              inet addr:10.0.0.100  Bcast:10.0.0.255  Mask:255.255.255.0
+              UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
+
+    lo        Link encap:Local Loopback  
+              inet addr:127.0.0.1  Mask:255.0.0.0
+              inet6 addr: ::1/128 Scope:Host
+              UP LOOPBACK RUNNING  MTU:16436  Metric:1
+              RX packets:2 errors:0 dropped:0 overruns:0 frame:0
+              TX packets:2 errors:0 dropped:0 overruns:0 carrier:0
+              collisions:0 txqueuelen:0
+              RX bytes:224 (224.0 b)  TX bytes:224 (224.0 b)
+
+Note an entry for `eth0:0` has miraculously appeared.
+
+Also `df` should include the entry:
+
+    /dev/drbd0             20G  172M   19G   1% /drbd
+
+Reboot nfs1 and the services should appear on nfs2.
+
+Connect an NFS client to you floating address (10.0.0.100) and you
+should be able to kill the live node and it will carry on.