svn commit: r365836 - head/share/mk

Steffen Nurpmeso steffen at sdaoden.eu
Thu Sep 17 23:16:30 UTC 2020


Warner Losh wrote in
 <CANCZdfqxXn0o2tYTXWpKPTQpc9iHp7DzFH-BSghdy+P9N9PXJg at mail.gmail.com>:
 |On Thu, Sep 17, 2020, 11:25 AM Jessica Clarke <jrtc27 at freebsd.org> wrote:
 |>> On 17 Sep 2020, at 18:23, Jessica Clarke <jrtc27 at freebsd.org> wrote:
 |>>
 |>>> On 17 Sep 2020, at 18:05, Rodney W. Grimes <freebsd at gndrsh.dnsmgr.net>
 |> wrote:
 |>>>
 |>>>> On Thu, Sep 17, 2020 at 9:39 AM Steffen Nurpmeso <steffen at sdaoden.eu>
 |> wrote:
 |>>>>
 |>>>>> Alex Richardson wrote in
 |>>>>> <202009171507.08HF7Qns080555 at repo.freebsd.org>:
 |>>>>>|Author: arichardson
 |>>>>>|Date: Thu Sep 17 15:07:25 2020
 |>>>>>|New Revision: 365836
 |>>>>>|URL: https://svnweb.freebsd.org/changeset/base/365836
 |>>>>>|
 |>>>>>|Log:
 |>>>>>|  Stop using lorder and ranlib when building libraries
 |>>>>>|
 |>>>>>|  Use of ranlib or lorder is no longer necessary with current linkers
 |>>>>>|  (probably anything newer than ~1990) and ar's ability to create an
 |>>>>> object
 |>>>>>|  index and symbol table in the archive.
 |>>>>>|  Currently the build system uses lorder+tsort to sort the .o files
 |> in
 |>>>>>|  dependency order so that a single-pass linker can use them.
 |> However,
 |>>>>>|  we can use the -s flag to ar to add an index to the .a file which
 |> makes
 |>>>>>|  lorder unnecessary.
 |>>>>>|  Running ar -s is equivalent to running ranlib afterwards, so we can
 |>>>>> also
 |>>>>>|  skip the ranlib invocation.
 |>>>>>
 |>>>>> That ranlib thing yes (for long indeed), but i have vague memories
 |>>>>> that the tsort/lorder ordering was also meant to keep the things
 |>>>>> which heavily interdepend nearby each other.  (Luckily Linux
 |>>>>> always had at least tsort available.)
 |>>>>> This no longer matters for all the platforms FreeBSD supports?
 |>>>>>
 |>>>>
 |>>>> tsort has no notion of how dependent the modules are, just an order
 |> that
 |>>>> allows a single pass through the .a file (otherwise you'd need to list
 |> the
 |>>>> .a file multiple times on the command line absent ranlib). That's the
 |>>>> original purpose of tsort. tsort, lsort, and ranlib all arrived in 7th
 |>>>> edition unix on a PDP-11, where size was more important than proximity
 |> to
 |>>>> locations (modulo overlays, which this doesn't affect at all).
 |>>>>
 |>>>> There were some issues of long vs short jumps on earlier architectures
 |> that
 |>>>> this helped (since you could only jump 16MB, for example). However,
 |> there
 |>>>> were workarounds for this issue on those platforms too. And if you
 |> have a
 |>>>> program that this does make a difference, then you can still use
 |>>>> tsort/lorder. They are still in the system.
 |>>>>
 |>>>> I doubt you could measure a difference here today. I doubt, honestly,
 |> that
 |>>>> anybody will notice at all.
 |>>>
 |>>> The x86 archicture has relative jmps of differning lengths, even in
 |> long mode
 |>>> there is support for rel8 and rel32.
 |>>
 |>> That's irrelevant though for several reasons:
 |>>
 |>> 1. The compiler has already decided on what jump instructions to use
 |> based on
 |>>   the requested code model (unless you're on RISC-V and using GNU bfd ld
 |> as
 |>>   that supports linker relaxations that actually delete instruction
 |> bytes).
 |>>
 |>> 2. The linker is still free to reorder input sections however it likes,
 |> it
 |>>   doesn't have to follow the order of the input files (and the files
 |> within
 |>>   any archive).
 |>
 |> Hm actually that's only true for archives; it needs to respect the \
 |> order of
 |> files on the command line for things like crti.o to work. But regardless,
 |> the
 |> other points (and this one, partially) still hold.
 |>
 |>> 3. If you care about those kinds of optimisations you should use
 |> link-time
 |>>   optimisation which will likely do far more useful things than just
 |> optimise
 |>>   branches, but again isn't constrained by the order of the input files,
 |> it
 |>>   can lay out the code exactly how it wants.
 |>>
 |>> Not to mention that this is just a topological sort, not a clustering
 |> sort.
 |>
 |
 |Yea. I doubt you'd be able to measure a difference on anything in our tree.

Very interesting, thank you all.
Profiling based sort order, impressive even.  I thought more about
cache hotness and, simply, keeping interdependent things together
as such.  But well, caches are so big today, and everything is
dynamically linked, i only jerk a bit due to that runtime cost
myself.

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)


More information about the svn-src-head mailing list