Why is thread local storage so slow?

Posted by dsimcha on Stack Overflow See other posts from Stack Overflow or by dsimcha
Published on 2009-02-03T05:28:37Z Indexed on 2010/03/20 10:51 UTC
Read the original article Hit count: 507

Filed under:

I'm working on a custom mark-release style memory allocator for the D programming language that works by allocating from thread-local regions. It seems that the thread local storage bottleneck is causing a huge (~50%) slowdown in allocating memory from these regions compared to an otherwise identical single threaded version of the code, even after designing my code to have only one TLS lookup per allocation/deallocation. This is based on allocating/freeing memory a large number of times in a loop, and I'm trying to figure out if it's an artifact of my benchmarking method. My understanding is that thread local storage should basically just involve accessing something through an extra layer of indirection, similar to accessing a variable via a pointer. Is this incorrect? How much overhead does thread-local storage typically have?

Note: Although I mention D, I'm also interested in general answers that aren't specific to D, since D's implementation of thread-local storage will likely improve if it is slower than the best implementations.

Related posts about multithreading

C++ Multithreading on Unix

as seen on Programmers - Search for 'Programmers'
I have two related questions: 1) Are there any good books for multithreading in C++, especially now that C++11 contains multithreading in the standard library? 2) I have the Wrox Programming on Unix book (1000 pages fat red one) and within it, it uses the Unix Thread class. How does this code relate… >>> More
what are the difficulties of operating system multithreading?

as seen on Stack Overflow - Search for 'Stack Overflow'
I am reading a book that compares two ways of implementing threads, Middleware Threads and OS Threads. I have a question about these sentences: "A difficulty of operating system multithreading, however, is performance overhead. Since it is the operating system that is involved in switching threads… >>> More
Multithreading recommendation based on program description

as seen on Stack Overflow - Search for 'Stack Overflow'
I would like to describe some specifics of my program and get feedback on what the best multithreading model to use would be most applicable. I've spent a lot of time now reading on ThreadPool, Threads, Producer/Consumer, etc. and have yet to come to solid conclusions. I have a list of files (all… >>> More
TCP multicast and multithreading

as seen on Stack Overflow - Search for 'Stack Overflow'
I need to come up with clients that can multicast to other clients reliably. That implies I'll be using TCP to connect reliably between clients within a multicast group. Doesn't that come up to n^2 number of connections? That seems a little silly to me. Wouldn't/shouldn't there be a way to more easily… >>> More
How to achieve multithreading using JavaScript with IE6?

as seen on Stack Overflow - Search for 'Stack Overflow'
How to achieve multithreading using JavaScript with IE6? Is there a third party library to do this? >>> More

Developer IT

Why is thread local storage so slow? - Developer IT

Why is thread local storage so slow?

d

multithreading

Performance

Related posts about d

Related posts about multithreading

C++ Multithreading on Unix

what are the difficulties of operating system multithreading?

Multithreading recommendation based on program description

TCP multicast and multithreading

How to achieve multithreading using JavaScript with IE6?

Categories cloud