kern/161483: net / [carp] [patch] when preemption is enabled carp interface assumes MASTERship immediately even with higher advbase/advskew

Damien Fleuriot dam at my.gd
Tue Oct 11 10:00:21 UTC 2011


>Number:         161483
>Category:       kern
>Synopsis:       net / [carp] [patch] when preemption is enabled carp interface assumes MASTERship immediately even with higher advbase/advskew
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Tue Oct 11 10:00:19 UTC 2011
>Closed-Date:
>Last-Modified:
>Originator:     Damien Fleuriot
>Release:        applies to freebsd 8.x , 9.x
>Organization:
Hi-Media
>Environment:
irrelevant, problem still exists in SRC files for FreeBSD 9.0-BETA3
>Description:
Refer: my post on -stable
http://docs.freebsd.org/cgi/getmsg.cgi?fetch=368260+0+current/freebsd-stable

Refer the same problem on 8.x
http://www.freebsd.org/cgi/query-pr.cgi?pr=161123

Under normal operating circumstances, a CARP interface goes through the following states:
- INIT : when it's down
- BACKUP : immediately upon being brought up, the interface assumes a BACKUP role and starts its timer to know if it should claim mastership.
- MASTER : if the delay has expired (advbase * 3) without the interface seeing another master, it assumes mastership.


BUG: When preemption is enabled (net.inet.carp.preempt=1) , the CARP interface immediately assumes MASTERship regardless of its advbase and advskew values.

This causes CARP switchovers when a firewall from a CARP cluster is rebooted, for example.

In our case, this actually led to lost client connections, lost database sessions, developers' daemons crashes because of lost java/db connections...



This is a known problem with OpenBSD 3.8 and lower's implementation of CARP.
This has been fixed as of OpenBSD 3.9.
>How-To-Repeat:
Set up 2 boxes with a shared CARP IP.
Enable CARP preemption.

Bring down your CARP interface on the BACKUP box.
Bring it up again.
Notice how your interface assumed MASTERship for a short time.
Check with dmesg which confirms that your box actually preempted.
>Fix:
Remove all the code portion from sys/netinet/ip_carp.c that handles the INIT->MASTER transition immediately if preemption is enabled.

Patch attached.


Patch attached with submission follows:

--- sys/netinet/ip_carp.c       2011-09-24 18:10:38.000000000 +0200
+++ sys/netinet/ip_carp.c       2011-10-11 10:11:27.000000000 +0200
@@ -1424,24 +1424,10 @@
 
        switch (sc->sc_state) {
        case INIT:
-               if (carp_opts[CARPCTL_PREEMPT] && !carp_suppress_preempt) {
-                       carp_send_ad_locked(sc);
-#ifdef INET
-                       carp_send_arp(sc);
-#endif
-#ifdef INET6
-                       carp_send_na(sc);
-#endif /* INET6 */
-                       CARP_LOG("%s: INIT -> MASTER (preempting)\n",
-                           SC2IFP(sc)->if_xname);
-                       carp_set_state(sc, MASTER);
-                       carp_setroute(sc, RTM_ADD);
-               } else {
-                       CARP_LOG("%s: INIT -> BACKUP\n", SC2IFP(sc)->if_xname);
-                       carp_set_state(sc, BACKUP);
-                       carp_setroute(sc, RTM_DELETE);
-                       carp_setrun(sc, 0);
-               }
+               CARP_LOG("%s: INIT -> BACKUP\n", SC2IFP(sc)->if_xname);
+               carp_set_state(sc, BACKUP);
+               carp_setroute(sc, RTM_DELETE);
+               carp_setrun(sc, 0);
                break;
        case BACKUP:
                callout_stop(&sc->sc_ad_tmo);


>Release-Note:
>Audit-Trail:
>Unformatted:


More information about the freebsd-bugs mailing list