amrd disk performance drop after running under high load
Krassimir Slavchev
krassi at bulinfo.net
Tue Oct 16 02:15:19 PDT 2007
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Alexey Popov wrote:
> Hi.
>
> Kris Kennaway wrote:
>
>>> After some time of running under high load disk performance become
>>> expremely poor. At that periods 'systat -vm 1' shows something like
>>> this:
>> What does "high load" mean? You need to explain the system workload
>> more.
> This web service is similiar to YouTube. This server is video store. I
> have around 200G of *.flv (flash video) files on the server.
>
> I run lighttpd as a web server. Disk load is usually around 50%, network
> output 100Mbit/s, 100 simultaneous connections. CPU is mostly idle.
>
> As you can see it is a trivial service - sending files to network via HTTP.
>
>>
>>> Disks amrd0
>>> KB/t 85.39
>>> tps 5
>>> MB/s 0.38
>>> % busy 99
>>
>>> Apart of all, I tried to make mutex profiling and here's the results
>>> (sorted by the total number of acquisitions):
>>>
>>> Bad case:
>>>
>>> 102 223514 273977 0 14689 1651568 /usr/src/sys/vm/uma_core.c:2349 (512)
>>> 950 263099 273968 0 15004 14427 /usr/src/sys/vm/uma_core.c:2450 (512)
>>> 108 150422 175840 0 10978 22988519 /usr/src/sys/vm/uma_core.c:1888
>>> (mbuf)
>>
>> > Here you can see that high UMA activity happens in periods of low disk
>> > performance. But I'm not sure whether this is a root of the
>> problem, not
>> > a consequence.
>>
>> The extremely high contention there does seem to say you have a mbuf
>> starvation problem and not a disk problem. I don't know why this
>> would be happening off-hand.
> But there's no mbuf shortage in `netstat -m`.
>
> What else can I try to track down the source of the problem?
>
>> Can you also provide more details about the system hardware and
>> configuration?
> This is Dell 2850 2 x Xeon 3.2, 4Gb RAM, 6x300Gb SCSI RAID5. I'll attach
> details.
>
> With best regards,
> Alexey Popov
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> freebsd-stable at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"
last pid: 11008; load averages: 0.07, 0.10, 0.08 up 47+08:32:50
11:46:15
38 processes: 1 running, 37 sleeping
Mem: 46M Active, 3443M Inact, 246M Wired, 144M Cache, 208M Buf, 5596K Free
Swap: 2048M Total, 4K Used, 2048M Free
PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND
56386 root 1 4 0 19856K 10000K kqread 1 115:19 2.88% lighttpd
636 root 1 96 0 18292K 4212K select 0 25:39 0.00% snmpd
784 root 1 96 0 19668K 2072K select 1 2:31 0.00% sshd
680 root 1 96 0 7732K 1384K select 0 1:59 0.00% ntpd
1540 root 1 96 0 35092K 6496K select 0 1:30 0.00% httpd
769 root 4 20 0 14148K 2632K kserel 0 1:04 0.00% bacula-fd
755 root 1 96 0 3852K 1060K select 1 0:22 0.00% master
568 root 1 96 0 3648K 908K select 0 0:18 0.00% syslogd
80663 root 1 8 0 3688K 1016K nanslp 1 0:05 0.00% cron
760 postfix 1 96 0 3944K 1160K select 0 0:04 0.00% qmgr
89776 www 1 20 0 35180K 6684K lockf 0 0:04 0.00% httpd
89763 www 1 20 0 35180K 6684K lockf 0 0:04 0.00% httpd
89774 www 1 20 0 35180K 6684K lockf 0 0:04 0.00% httpd
89775 www 1 96 0 35180K 6684K select 0 0:04 0.00% httpd
699 root 1 20 0 7732K 1388K pause 0 0:03 0.00% ntpd
484 root 1 4 0 652K 220K select 0 0:00 0.00% devd
10904 llp 1 96 0 30616K 3564K select 0 0:00 0.00% sshd
10915 root 1 20 0 3912K 2340K pause 1 0:00 0.00% csh
You run apache with mod_perl or php too. How many clients handle this
apache server? Also in this light load you have locked files! Check
script execution times (/server-status may be useful).
When you have hight load check swap usage and haw many processes are in
lockf state.
Best Regards
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (FreeBSD)
iD8DBQFHFHuVxJBWvpalMpkRAnqTAJ9FgURNk98dtD0HYX6xIz17R6sLpQCgh5nJ
XBtfOyzJJbkjzVzSF/WfmHc=
=oTHZ
-----END PGP SIGNATURE-----
More information about the freebsd-hackers
mailing list