[patch] Shark reroute LLVM atomic intrinsics to Zero

Gary Benson gbenson at redhat.com
Tue Apr 7 02:51:01 PDT 2009


Matthew Flaschen wrote:
> Gary Benson wrote:
> > Having said that, is there any reason to use atomic intrinsics on a
> > single processor system?  And, do multiprocessor ARM systems exist?
> 
> I think the answers are yes, and definitely yes.  Even with a single
> core, a thread can still be preempted.  And there are definitely
> multi-processor ARM
> (http://www.arm.com/products/CPUs/ARMCortex-A9_MPCore.html).

Ok, so there's no way around this.

I'm thinking is that ARM is a special case here: you could implement
SharkBuilder::CreateCmpxchg* on ARM by inlining arm_compare_and_swap
from atomic_linux_zero.inline.hpp.  That wouldn't require any LLVM
modifications, and it wouldn't be any slower than a "real"
implementation of llvm.atomic.cmp.swap.i32 because either way the
atomic part is handled by the call to __kernel_cmpxchg so the
generated code would be the same.

Obviously it would be preferable for this code to be in LLVM rather
than in Shark, but I know that implementing it in LLVM won't be quick
or easy so I can live with a bit of clutter in Shark for this.

What I'm not happy with is the idea of a generic solution that would
perform a function call on platforms that don't require one, because
that _would_ be slower than an implementation of llvm.atomic.cmp.swap.
For platforms that have instructions to do this I'm going to insist
people bite the bullet and implement the intrinsic.

Cheers,
Gary

-- 
http://gbenson.net/



More information about the distro-pkg-dev mailing list