POSIX Option Groups

Barriers are one of the most recent extensions to the thread functionality. Rendezvous are of part of algorithms performing calculations in multiple threads.

pthread_barrier_destroy()
pthread_barrier_init()
pthread_barrier_wait()
pthread_barrierattr_destroy()
pthread_barrierattr_init()

The thread library on Linux implements these functions completely since version 2.2. There is as of this writing no other non-embedded OS with support for these interfaces.

_POSIX_THREAD_PROCESS_SHARED

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_CHOWN_RESTRICTED`

Description:

If this option is defined changing the owner of filesystem objects is restricted. Normally only the super user is allowed to do this.

Affected interfaces:

chown()
lchown()

Status:

Linux defined this option since day one.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_CLOCK_SELECTION`

Description:

This options lumps together three different groups of functions. Basically, they all add support for clocks other than the normal realtime clock. The latter is what is used implicitly everywhere time is measured unless explicitly stated otherwise. But this clock might have limitations such as precision.

The pthread_condattr_getclock() and pthread_condattr_setclock() allow to specify the clock which is used in pthread_cond_timedwait(). clock_nanosleep() is a superset of nanosleep(). It allows using alternative clocks but it also allows to specify absolute timeouts. clock_settime() is the interface to set any of the clocks which can be set.

Affected interfaces:

pthread_condattr_getclock()
pthread_condattr_setclock()
clock_nanosleep()
clock_settime()

Status:

Update: The Linux kernel version 2.5.64 and later provide the necessary support. glibc 2.3.3 with NPTL supports this option.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_CPUTIME`

Description:

Some architectures provide user-level access to a very high-resolution clock the processor implements. Normally this simply a counter of cycles the chip is driven by. To be useful for this option the clock must not turn around often. For this reason Alpha does not have this option available although the architecture has a CPU cycle counter available. But its 32-bit size limit the usefulness.

Other processors, IA-32, IA-64 and perhaps more in future, do have a cycle counter register. The interfaces of this option provide support to use these registers without resorting to architecture specific code. The information is provided in nanoseconds and not in cycle counts.

Warning: Before the 2.6.10 kernel the CPU clock implementation we use might not be what other people want. The clocks basically measured wallclock time using CPU registers. This changed in 2.6.10 when new system calls were added and now scheduling can be taken into account. Now the clocks show the time which the system actually spent on the process/thread. Just like ru_utime/ru_stime info in struct rusage but available at all times.

Affected interfaces:

clock_getcpuclockid()
clock_getres()
clock_gettime()
clock_nanosleep()
clock_settime()
timer_create()

Status:

Before the 2.6.10 kernel some support was available for processors with easy access to CPU cycle counters. For some architectures, like PPC, the great variety of CPU implementations based on the architecture complicate things and nothing has be done to solve the issue.

Update: After 2.6.10 support for all architectures and the correct semantics is available. The precision might vary between the architectures.

`_POSIX_FSYNC`

Description:

This option marks the fsync() interface which might not be useful or implementable on some system but Linux had it forever.

Affected interfaces:

fsync()

Status:

Is implemented since the very early days.

`_POSIX_IPV6`

Description:

This option is signals support for the IPv6 protocol in addition to IPv4. To support IPv6 a number of new interfaces were introduced and some existing interfaces extended. It also means that a number of interfaces which were used with IPv4 cannot be used anymore in protocol-independent code.

Affected interfaces:

accept()
bind()
connect()
freeaddrinfo()
gai_strerror()
getaddrinfo()
gethostbyaddr()
getnameinfo()
getpeername()
getsockname()
getsockopt()
inet_ntop()
inet_pton()
recvfrom()
sento()
setsockopt()

Status:

IPv6 is usable in Linux since at least the Linux 2.4 days.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_JOB_CONTROL`

Description:

This option is another remembrance of the old days of very simple system. Job control was always available on Linux. Processes could always be sent to the background etc.

Affected interfaces:

setpgid()
tcdrain()
tcflush()
tcgetpgrp()
tcsendbreak()
tcsetattr()
tcsetpgrp()

Status:

Is implemented since the very early days.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_MAPPED_FILES`

Support for mapping files into the address space is one of the main requirements of shared library implementations. Otherwise the "shared" part couldn't be implemented. Linux got support for this very early on and all kernel versions the current ABI support have this features.

The only possible reason this feature is missing could be if it is deliberately left out which could make sense to strip a kernel down a bit more for the use in embedded systems where shared libraries are often not a requirement.

mmap()
msync()
munmap()

Is implemented since the very early days.

_POSIX_ADVISORY_INFO

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_MEMLOCK`

Description:

Mapping file contents in the address space allows the OS to optimize the handling of the needed memory by loading the data only when really needed and evacuate portions of already loaded data when the memory is needed otherwise. If delay associated with the overhead to load the data on demand is not acceptable the data can be forced to stay in memory. This feature is also available for all currently support kernel versions.

Affected interfaces:

mlockall()
munlockall()

Status:

Is implemented since the very early days.

`_POSIX_MEMLOCK_RANGE`

Description:

A bit more flexible than the functionality of _POSIX_MEMLOCK this options allows to lock parts of a file in memory. This feature is also available in all supported Linux kernel versions.

Affected interfaces:

mlock()
munlock()

Status:

Is implemented since the very early days.

`_POSIX_MEMORY_PROTECTION`

Description:

Changing the access protection of mapped memory regions is useful for many purposes. A reliable implementation of shared libraries requires it and therefore this feature is available in all support kernel versions.

Affected interfaces:

mprotect()

Status:

Is implemented since the very early days.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_MESSAGE_PASSING`

POSIX message queues are used, similar to the SysV message queues, to pass information between different threads and/or processes. It is often faster to use than pipes and more flexible since multiple producers and consumers can use the same message queue. Plus, using SIGEV_THREAD it is possible to implement a kind of remote procedure call.

mq_close()
mq_getattr()
mq_notify()
mq_open()
mq_receive()
mq_send()
mq_setattr()
mq_unlink()

The implementation of POSIX message queues requires kernel support which got added after 2.6.5 (i.e., 2.6.6 will be the first official kernel with the support). The librt in glibc 2.3.4 after 2004-4-12 includes the necessary userlevel support.

_POSIX_TIMEOUTS

`_POSIX_MONOTONIC_CLOCK`

Description:

The monotonic clock was introduced to allow the user to implement relative timeouts. The problem with the realtime clock which normally is used is that it can be reset with the consequence that timeouts maybe be lengthened or shortened depending on the direction of the clock adjustment. But the availability of this option also means that all interface which normally would use the realtime clock by default now use the monotonic clock which can lead to problem since this is not expected by most of the code written up to this day.

Affected interfaces:

clock_getres()
clock_gettime()
clock_nanosleep()
clock_settime()
timer_create()

Status:

Update: glibc 2.3.3 has support for the functions governed by this option based on support in the 2.5 kernel. Older glibc versions had a userlevel implementation of limited quality and only for CLOCK_REALTIME..

`_POSIX_NO_TRUNC`

Description:: This feature was a bad compromise for some broken systems. Long filenames were silently truncated generating surprising effects and security holes. Fortunately support for this option is now required.
Affected interfaces:: Every interface handling file names.
Status:: Linux always supported this option.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_PRIORITIZED_IO`

Asynchronous I/O provides the possibility to queue many I/O requests and have them worked on while the program can concentrate on other code. How and when the requests are worked on is up to the implementation to decide. This is sometimes not enough. Important data might have to be preferred. The AIO interface provides a mean to define priorities for I/O requests if this feature is available.

aio_read()
aio_write()

This is a matter-of-quality item on the checklist for the AIO implementation. The current user-level implementation has the necessary support available. The new kernel-level implementation will hopefully also have the needed support.

_POSIX_ASYNCHRONOUS_IO

`_POSIX_PRIORITY_SCHEDULING`

In situations where certain actions have to be performed as fast as possible the scheduling and priority of threads and processes can be changed. This will allow preferring certain threads and processes over others. In embedded systems which react on outside an stimulus and have to perform an action promptly this is important. But also desktop systems benefit, for instance, for video and audio recording and display.

sched_get_priority_max()
sched_get_priority_min()
sched_getparam()
sched_getscheduler()
sched_rr_get_interval()
sched_setparam()
sched_setscheduler()
sched_yield()

If the _POSIX_SPAWN option is defined the following interfaces are available as well:

posix_spawnattr_getschedparam()
posix_spawnattr_getschedpolicy()
posix_spawnattr_setschedparam()
posix_spawnattr_setschedpolicy()

The Linux kernel supports realtime scheduling for many years now.

_POSIX_SPORADIC_SERVER

`_POSIX_RAW_SOCKETS`

Description:

Raw sockets were a disputed socket type when the discussion was made about including them in the POSIX standard. The problem is that not much can be specified generally without consideration of the kind of socket which is manipulated.

Affected interfaces:

getsockopt()
setsockopt()

Status:

The Linux kernel implements raw sockets for all kinds of socket types. The standard behavior is implemented as are many more features.

`_POSIX_READER_WRITER_LOCKS`

Reader-writer locks are a special kind of mutex which allows multiple readers at any one time but only one writer. In situations where the protected data is more often read than written using these mutexes is of benefit.

pthread_rwlock_destroy()
pthread_rwlock_init()
pthread_rwlock_rdlock()
pthread_rwlock_tryrdlock()
pthread_rwlock_trywrlock()
pthread_rwlock_unlock()
pthread_rwlock_wrlock()
pthread_rwlockattr_destroy()
pthread_rwlockattr_init()

Reader-writer locks are implemented in the thread library for many years, even before they were added to the POSIX standard. The implementation provides two versions: a version which prefers readers and one which prefers writers.

_POSIX_THREAD_PROCESS_SHARED

This option is mandatory in IEEE 1003.1-2008. Even in IEEE 1003.1-2001 the functions must always be available if threads are supported.

`_POSIX_REALTIME_SIGNALS`

Description:

Standard Unix signals have several drawbacks. First, they do not queue. If one signal of a kind is pending new ones are simply discarded. Second, the signal cannot carry any information which requires either the use of several different signals to transmit information or some mechanism outside the signal handler has be used (e.g., global variables) which has its own set of problems.

Realtime signals do queue and can transmit information. They also solve problems related to existing signals. E.g., the SIGSEGV can now transmit all the information to locate the reason of the segmentation fault.

Affected interfaces:

sigqueue()
sigtimedwait()
sigwaitinfo()

Status:

Realtime signals are implement for many years and are in wide use.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_REGEXP`

Description:

Regular expressions were one of the big inventions of the original POSIX standard and although some people prefer other regular expression implementations the POSIX version prevailed in most situations. It provides all the necessary features and is widely available.

Affected interfaces:

regcomp()
regerror()
regexec()
regfree()

Status:

Regular expressions were always available in glibc. The quality of the implementation is another issues. Until glibc 2.3 there were issues with some border cases but internationalization features were available (unlike in most other implementations). Starting with glibc 2.3 a new implementation is available and it should fix the remaining problems with compliance to the standard. Full internationalization support is included as well.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_SAVED_IDS`

Description:

A process having effective and normal user and group IDs are a mean to increase security. Early POSIX standards didn't require support because some systems at that time didn't provide the feature. The alignment of the latest POSIX standard with FIPS makes this feature mandatory.

Affected interfaces:

geteuid()
getegid()
getgid()
getuid()
seteuid()
setegid()
setgid()
setuid()

Status:

Linux always support saved IDs. The kernel even provides finer-grained IDs which functions which determine the access rights to files.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_SEMAPHORES`

Semaphores were added to POSIX not as part of the thread package but as a separate set of interfaces. The interface allows easy use of semaphores in different processes by creating named semaphore objects. At the same time anonymous semaphores are available for the use in multi-threaded applications.

sem_close()
sem_destroy()
sem_getvalue()
sem_init()
sem_open()
sem_post()
sem_trywait()
sem_unlink()
sem_wait()

glibc 2.3 with NPTL has full support for semaphores, including named semaphores and inter-process semaphores. Earlier glibc versions have only support for anonymous semaphores.

_POSIX_TIMEOUTS

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_SHARED_MEMORY_OBJECTS`

Description:

POSIX shared memory could theoretically be implemented without special kernel support but the implementation wouldn't be optimized and it would, depending on the setup, be a security problem. The 2.2 series of the kernel introduced a special filesystem type to support shared memory (refined in the 2.4 series) which allowed creating files without backing them with space on a device.

Affected interfaces:

mmap()
munmap()
shm_open()
shm_unlink()

Status:

Starting with glibc 2.1 support for POSIX shared memory was available. But each system must be configured to provide the necessary filesystem.

`_POSIX_SHELL`

Description:

This option was introduced to allow creating profiles for embedded systems which don't need shells and command lines. Non-embedded systems always have a shell.

The POSIX standard requires that system() is usable in multi-threaded applications but this is hardly ever implemented correctly since the implementation has to change global state.

Affected interfaces:

system()

Status:

Normal Linux setups always have a shell and this option defined. glibc 2.3.2 with NPTL support should even implement the multi-threaded application requirement correctly.

This option is mandatory in IEEE 1003.1-2001 and later.

`_POSIX_SPAWN`

Description:

To support multiple processes on systems without MMU support something other than the fork() and exec POSIX model is needed. The solution is the spawn family of functions. The functions can be implemented in the kernel. This way the kernel can avoid the fork() step. For systems which do support fork() the functions can be implemented at user-level.

Affected interfaces:

posix_spawn()
posix_spawn_file_actions_addclose()
posix_spawn_file_actions_adddup2()
posix_spawn_file_actions_addopen()
posix_spawn_file_actions_destroy()
posix_spawn_file_actions_init()
posix_spawnattr_destroy()
posix_spawnattr_getsigdefault()
posix_spawnattr_getflags()
posix_spawnattr_getpgroup()
posix_spawnattr_getsigmask()
posix_spawnattr_init()
posix_spawnattr_setsigdefault()
posix_spawnattr_setflags()
posix_spawnattr_setpgroup()
posix_spawnattr_setsigmask()
posix_spawnp()

Status:

Starting with version 2.2 glibc has a user-level implementation of these interfaces.

`_POSIX_SPIN_LOCKS`

Description:

Spinlocks are a form of synchronization primitive which can be used only in carefully chosen situation. If programs which use threads with different priorities every use of a spinlock in a thread which does not have the highest priority is a gamble. But if spinlocks are usable they provide significant speed advantages.

Affected interfaces:

pthread_spin_destroy()
pthread_spin_init()
pthread_spin_lock()
pthread_spin_trylock()
pthread_spin_unlock()

Status:

Spinlocks are implemented since glibc 2.2. Even the inter-process variant is available since it does not require kernel support.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_SPORADIC_SERVER`

This option introduces a special scheduling variant for certain situations which seem important enough. The specification is very vague and such an option is of questionable usefulness since not very generic.

sched_getparam()
sched_getscheduler()
sched_setparam()
sched_setscheduler()

The Linux kernel does not implement this scheduling option. No effort to add such support is known.

_POSIX_PRIORITY_SCHEDULING

`_POSIX_SYNCHRONIZED_IO`

Description:

To create a consistent state when it comes to disk I/O it is necessary to force all output from the buffers the kernel uses to the underlying device. With this option the POSIX standard provide a number of different ways to make this possible.

Affected interfaces:

fcntl()
open()
msync()
fdatasync()

Status:

At least with the 2.4 kernels all the necessary support is in place. All the interfaces and functionality required by POSIX are available.

`_POSIX_THREAD_ATTR_STACKADDR`

Description:

Each new thread has to have its own stack and all the stacks of the different threads in a process are in the same address space. Where the stacks are allocated is by default the thread libraries issue. If the use of the address space is an issue, as it sometimes is on 32-bit machine, the application can make the decision by determining the stack address explicitly. The stack is in this case allocated by the user.

Affected interfaces:

pthread_attr_getstack()
pthread_attr_getstackaddr()
pthread_attr_setstack()
pthread_attr_setstackaddr()

Status:

This option is implemented since glibc 2.1 for those architectures which provide a thread register. Since a thread register is a prerequisite for NPTL these functions are always supported for this thread library.

`_POSIX_THREAD_ATTR_STACKSIZE`

Description:

Each new thread has to have its own stack and all the stacks of the different threads in a process are in the same address space. This can create two kinds of problems:

The default stack size is too small. The thread will terminate at some point with an error or it will create wrong results because it reads from and write to invalid memory.
The default stack size is too large. Every stack size is too large if the number of threads is very high.

Affected interfaces:

pthread_attr_getstack()
pthread_attr_getstacksize()
pthread_attr_setstack()
pthread_attr_setstacksize()

Status:

`_POSIX_THREAD_CPUTIME`

This option is similar to the _POSIX_CPUTIME option only that the time starts at zero for each individual thread.

pthread_getcpuclockid()
clock_getres()
clock_gettime()
clock_nanosleep()
clock_settime()
timer_create()

This option is implemented in glibc 2.2. The system must fulfill the same requirements as for the _POSIX_CPUTIME option, namely that the CPU must support a cycle counter register.

_POSIX_CPUTIME

`_POSIX_THREAD_PRIO_INHERIT`

Description:

This option can lead to better handling of threads with different priorities. If a high-priority thread is waiting on a mutex which is held by a lower-priority thread the latter is continuing to use its own priority although it blocks a thread with a higher priority. If this option is defined the user can define a mutex which automatically adjusts the priority of the thread holding the mutex based on the priority of the waiters.

Affected interfaces:

pthread_mutexattr_getprotocol()
pthread_mutexattr_setprotocol()

Status:

The 2.6.18 kernel has the necessary support to implement this option. PI futexes are enabled in (almost?) all configurations. The thread library changes are in version 2.5 which starts to be available in Fedora Core 6 and RHEL5.

`_POSIX_THREAD_PRIO_PROTECT`

Description:

This option can lead to better handling of threads with different priorities. If a high-priority thread is waiting on a mutex which is held by a lower-priority thread the latter is continuing to use its own priority although it blocks a thread with a higher priority. If this option is defined the user can define a mutex which always increases the priority to a given level regardless of whether there are waiters or not.

Affected interfaces:

pthread_mutex_getprioceiling()
pthread_mutex_setprioceiling()
pthread_mutexattr_getprioceiling()
pthread_mutexattr_getprotocol()
pthread_mutexattr_setprioceiling()
pthread_mutexattr_setprotocol()

Status:

This option is implemented for NPTL since August 2006. FC6 and RHEL5 will have this option. It is a userlevel implemention but it is believed to be compliant. The implementation does not support priority protection for robust mutexes. But since robust mutexes are not (yet) part of POSIX this has no effect on this option.

`_POSIX_THREAD_PRIORITY_SCHEDULING`

If this option is defined the different threads inside a process can run with different priorities and/or different schedulers.

pthread_attr_getinheritsched()
pthread_attr_getschedpolicy()
pthread_attr_getscope()
pthread_attr_setinheritsched()
pthread_attr_setschedpolicy()
pthread_attr_setscope()
pthread_getschedparam()
pthread_setschedparam()
pthread_setschedprio()

This option is implemented in the LinuxThread library since glibc 2.1. NPTL does not support this option so far since there the priority protection support is not yet present.

_POSIX_PRIORITY_SCHEDULING

`_POSIX_THREAD_PROCESS_SHARED`

The synchronization primitives provided by the thread library can normally only be used among threads in the same process. They are also useful to synchronize between different processes or threads therein but this support requires help from the kernel.

pthread_barrierattr_getpshared()
pthread_barrierattr_setpshared()
pthread_condattr_getpshared()
pthread_condattr_setpshared()
pthread_mutexattr_getpshared()
pthread_mutexattr_setpshared()
pthread_rwlockattr_getpshared()
pthread_rwlockattr_setpshared()

Not part of this option (since derived not from the thread extensions to POSIX) are the POSIX semaphore functions. The sem_init() interface's second parameter allows controlling inter-process sharing. An implementation is usually linked with the implementation of this option.

The kernel support for this option wasn't available until 2.5.7. Future versions of the thread library will support this option.

Update: The Native POSIX Thread Library has support for all the functions governed by this option, including sem_init().

`_POSIX_THREAD_SAFE_FUNCTIONS`

Description:

Some of the interfaces in the POSIX standard are not thread-safe and some are changed noticeably in complexity and performance. For some but not all not thread-safe functions the POSIX standard defines variants if this option is present. The functions which were changed greatly speed-wise by the introduction of threads include the standard I/O function for which POSIX defines variants which can be implemented just like the functions in pre-thread times.

Affected interfaces:

readdir_r()
getgrgid_r()
getgrnam_r()
getpwnam_r()
getpwuid_r()
flockfile()
ftrylockfile()
funlockfile()
getc_unlocked()
getchar_unlocked()
putc_unlocked()
putchar_unlocked()
rand_r()
strerror_r()
strtok_r()
asctime_r()
ctime_r()
gmtime_r()
localtime_r()

Status:

All the mentioned functions (and more) are implemented in glibc.

This option is mandatory in IEEE 1003.1-2008.

`_POSIX_THREAD_SPORADIC_SERVER`

Just like the _POSIX_SPORADIC_SERVER option this option provides support for one more scheduler.

sched_getparam()
sched_setparam()
sched_setscheduler()

This option cannot be implemented if _POSIX_SPORADIC_SERVER cannot be implemented.