Fresh installed Freebsd 9 don't boot from hd

Andriy Gapon avg at FreeBSD.org
Sun Oct 23 17:58:05 UTC 2011


on 23/10/2011 18:27 Dennis Koegel said the following:
> On Fri, Oct 21, 2011 at 04:33:38PM -0400, John Baldwin wrote:
>> Working offline with Dennis, we found that changing the CFLAGS in 
>> sys/boot/i386/gptboot/Makefile from "-O1" to "-Os -mrtd" (partially reverting 
>> an earlier commit) fixed gptboot.  The next test for someone to do would be to 
>> try just adding "-mrtd" and leaving "-O1" as-is to see if that fixes it.
> 
> More test results:
> 
> gcc -Os -fno-guess-branch-probability -fomit-frame-pointer -fno-unit-at-a-time \
> 	-mno-align-long-strings -mrtd [from before r225530]: Boots OK
> gcc -Os -mrtd: Boots OK
> gcc -O1 -mrtd: Fails
> gcc -O1: Fails
> gcc -O0: Fails
> gcc -Os: Boots OK
> 
> clang -O1: Fails
> clang -Os: Fails
> clang -Oz: Fails
> 
> I've put some printf()s into gpt{,boot}.c to trace where the reboot is
> triggered. It appears to be in drvsize() (called from gptread()). OTOH
> the debug output may have changed where the problem occurs, I don't
> know about that.
> 
> With 9.0R drawing near, CFLAGS should be s/-O1/-Os/, until we can figure
> out what happens. But as for why gcc's magic -Os is required and clang's
> output doesn't work at all, I'm clueless.

Thank you for your very valuable analysis!
I looked at a difference in assembly code of the drvsize function produced by
gcc -Os and by gcc -O1.  One thing that was immediately obvious is that gcc
places the params array and the sectors variable in a different order for
different options.  One idea is that if BIOS actually writes beyond the end of
the array, then in one case it could be harmless (overwrites the sector
variable), but in the other case it could be more harmful.
I found a document that suggests a possibility of BIOS writing more bytes to the
array than its current size of 0x42:
http://www.t13.org/documents/UploadedDocuments/docs2008/e08134r1-BIOS_Enhanced_Disk_Drive_Services_4.0.pdf

Of course, the size of the array is passed to BIOS at the start of the array and
so a _non-buggy_ BIOS should not write beyond the array, but we live in a
non-perfect world.

Could you please test this hypothesis by trying the following patch?
diff --git a/sys/boot/i386/common/drv.c b/sys/boot/i386/common/drv.c
index 11f6628..5996a80 100644
--- a/sys/boot/i386/common/drv.c
+++ b/sys/boot/i386/common/drv.c
@@ -37,10 +37,10 @@ __FBSDID("$FreeBSD$");
 uint64_t
 drvsize(struct dsk *dskp)
 {
-	unsigned char params[0x42];
+	unsigned char params[0x4A];
 	uint64_t sectors;

-	*(uint32_t *)params = sizeof(params);
+	*(uint16_t *)params = sizeof(params);

 	v86.ctl = V86_FLAGS;
 	v86.addr = 0x13;



P.S. the assembly diff to which I referred above:
--- drvsize.Os.S	2011-10-23 20:17:56.871996966 +0300
+++ drvsize.O1.S	2011-10-23 20:18:27.430995560 +0300
@@ -4,8 +4,8 @@
 	pushl	%ebp
 	movl	%esp, %ebp
 	subl	$76, %esp
-	leal	-74(%ebp), %ecx
-	movl	$66, -74(%ebp)
+	leal	-66(%ebp), %ecx
+	movl	$66, -66(%ebp)
 	movl	$262144, __v86
 	movl	$19, __v86+4
 	movl	$18432, __v86+24
@@ -28,20 +28,20 @@
 	pushl	%eax
 	pushl	$.LC4
 	call	printf
-	xorl	%eax, %eax
-	xorl	%edx, %edx
-	popl	%ecx
-	popl	%ecx
+	movl	$0, %eax
+	movl	$0, %edx
+	addl	$8, %esp
 	jmp	.L16
+	.p2align 2,,3
 .L14:
 	pushl	$8
-	leal	-58(%ebp), %eax
+	leal	-50(%ebp), %eax
 	pushl	%eax
-	leal	-8(%ebp), %eax
+	leal	-76(%ebp), %eax
 	pushl	%eax
 	call	memcpy
-	movl	-8(%ebp), %eax
-	movl	-4(%ebp), %edx
+	movl	-76(%ebp), %eax
+	movl	-72(%ebp), %edx
 	addl	$12, %esp
 .L16:
 	leave

-- 
Andriy Gapon


More information about the freebsd-current mailing list