amd64 kernel dynamic linking allows extern references to statics

Jan Mikkelsen janm at transactionware.com
Wed Jul 15 23:18:24 UTC 2015


> On 15 Jul 2015, at 11:27 pm, Konstantin Belousov <kostikbel at gmail.com> wrote:
> 
> On Wed, Jul 15, 2015 at 06:17:20PM +1000, Jan Mikkelsen wrote:
>> Hi,
>> 
>> (All on 10.2-BETA1.)
>> 
>> I noticed that the latest patch in the bug https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=187594 <https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=187594> works on amd64 but fails to load zfs.ko on i386 with a symbol not found error.
>> 
>> Looking at the patch, there is one file that has ???extern int zio_use_uma??? to reference a variable that is declared elsewhere as ???static int zio_use_uma???. To me this obviously should not work. However it does work on amd64 but fails on i386.
>> 
>> Below is a small test case that reproduces the problem. The generated kernel module loads on amd64 but fails on i386. On amd64 one compilation unit is accessing a static in from another compilation unit by declaring the variable ???extern???. 
>> 
>> I haven???t looked further to attempt to find the bug. However, it looks like a Bad Thing??? to me.
>> 
> 
> I am not sure that this is fixable.  Issue is that amd64 modules are
> relinked object files, and they might have unresolved relocations against
> local symbols.  Change like the following probably fix your test case,
> but also quite possible would break legitimate local references.
> 
> diff --git a/sys/kern/link_elf_obj.c b/sys/kern/link_elf_obj.c
> index 021381d..6fa5276 100644
> --- a/sys/kern/link_elf_obj.c
> +++ b/sys/kern/link_elf_obj.c
> @@ -1096,7 +1096,8 @@ link_elf_lookup_symbol(linker_file_t lf, const char *name, c_linker_sym_t *sym)
> 
> 	for (i = 0, symp = ef->ddbsymtab; i < ef->ddbsymcnt; i++, symp++) {
> 		strp = ef->ddbstrtab + symp->st_name;
> -		if (symp->st_shndx != SHN_UNDEF && strcmp(name, strp) == 0) {
> +		if (symp->st_shndx != SHN_UNDEF && strcmp(name, strp) == 0 &&
> +		    ELF_ST_BIND(symp->st_info) != STB_LOCAL) {
> 			*sym = (c_linker_sym_t) symp;
> 			return 0;
> 		}

I don’t know why there could be an unresolved relocation against a local symbol. My (admittedly trivial) tests with nm and static functions/variables have not led to a “U” record. Under what circumstances would this happen?

Separately to this is that this behaviour also defeats multiple definition checking at link time. Modifying my test slightly to have one compilation unit with “static int testvar”, one compilation unit with “int testvar” and one with “extern int testvar” gives nm output like this from the .ko file:

00000000000000d0 d testvar
00000000000000c0 d testvar

It is unclear which instance of testvar the “extern” declaration used for resolution. Simple testing seems to show it was the one with the non-static storage class. However, a piece of code depending on the previous resolution to local names could break by the addition a file with a name clash. Also, not no “U” record — the “extern int testvar” declaration has been resolved at this point.

Making both definitions static gives this:

                 U testvar
00000000000000c0 d testvar
00000000000000d0 d testvar

Which testvar will I get at load time? Who knows? Adding a file with a static variable with a name clash in another compilation unit can change the behaviour of a module.

The big question for me is: under which legitimate case is this feature necessary? The downsides seem significant.

Regards,

Jan.



More information about the freebsd-stable mailing list