mirror of
https://github.com/python/cpython.git
synced 2024-11-30 18:51:15 +01:00
437 lines
17 KiB
ReStructuredText
437 lines
17 KiB
ReStructuredText
:mod:`!multiprocessing.shared_memory` --- Shared memory for direct access across processes
|
|
==========================================================================================
|
|
|
|
.. module:: multiprocessing.shared_memory
|
|
:synopsis: Provides shared memory for direct access across processes.
|
|
|
|
**Source code:** :source:`Lib/multiprocessing/shared_memory.py`
|
|
|
|
.. versionadded:: 3.8
|
|
|
|
.. index::
|
|
single: Shared Memory
|
|
single: POSIX Shared Memory
|
|
single: Named Shared Memory
|
|
|
|
--------------
|
|
|
|
This module provides a class, :class:`SharedMemory`, for the allocation
|
|
and management of shared memory to be accessed by one or more processes
|
|
on a multicore or symmetric multiprocessor (SMP) machine. To assist with
|
|
the life-cycle management of shared memory especially across distinct
|
|
processes, a :class:`~multiprocessing.managers.BaseManager` subclass,
|
|
:class:`~multiprocessing.managers.SharedMemoryManager`, is also provided in the
|
|
:mod:`multiprocessing.managers` module.
|
|
|
|
In this module, shared memory refers to "POSIX style" shared memory blocks
|
|
(though is not necessarily implemented explicitly as such) and does not refer
|
|
to "distributed shared memory". This style of shared memory permits distinct
|
|
processes to potentially read and write to a common (or shared) region of
|
|
volatile memory. Processes are conventionally limited to only have access to
|
|
their own process memory space but shared memory permits the sharing
|
|
of data between processes, avoiding the need to instead send messages between
|
|
processes containing that data. Sharing data directly via memory can provide
|
|
significant performance benefits compared to sharing data via disk or socket
|
|
or other communications requiring the serialization/deserialization and
|
|
copying of data.
|
|
|
|
|
|
.. class:: SharedMemory(name=None, create=False, size=0, *, track=True)
|
|
|
|
Create an instance of the :class:`!SharedMemory` class for either
|
|
creating a new shared memory block or attaching to an existing shared
|
|
memory block. Each shared memory block is assigned a unique name.
|
|
In this way, one process can create a shared memory block with a
|
|
particular name and a different process can attach to that same shared
|
|
memory block using that same name.
|
|
|
|
As a resource for sharing data across processes, shared memory blocks
|
|
may outlive the original process that created them. When one process
|
|
no longer needs access to a shared memory block that might still be
|
|
needed by other processes, the :meth:`close` method should be called.
|
|
When a shared memory block is no longer needed by any process, the
|
|
:meth:`unlink` method should be called to ensure proper cleanup.
|
|
|
|
:param name:
|
|
The unique name for the requested shared memory, specified as a string.
|
|
When creating a new shared memory block, if ``None`` (the default)
|
|
is supplied for the name, a novel name will be generated.
|
|
:type name: str | None
|
|
|
|
:param bool create:
|
|
Control whether a new shared memory block is created (``True``)
|
|
or an existing shared memory block is attached (``False``).
|
|
|
|
:param int size:
|
|
The requested number of bytes when creating a new shared memory block.
|
|
Because some platforms choose to allocate chunks of memory
|
|
based upon that platform's memory page size, the exact size of the shared
|
|
memory block may be larger or equal to the size requested.
|
|
When attaching to an existing shared memory block,
|
|
the *size* parameter is ignored.
|
|
|
|
:param bool track:
|
|
When ``True``, register the shared memory block with a resource
|
|
tracker process on platforms where the OS does not do this automatically.
|
|
The resource tracker ensures proper cleanup of the shared memory even
|
|
if all other processes with access to the memory exit without doing so.
|
|
Python processes created from a common ancestor using :mod:`multiprocessing`
|
|
facilities share a single resource tracker process, and the lifetime of
|
|
shared memory segments is handled automatically among these processes.
|
|
Python processes created in any other way will receive their own
|
|
resource tracker when accessing shared memory with *track* enabled.
|
|
This will cause the shared memory to be deleted by the resource tracker
|
|
of the first process that terminates.
|
|
To avoid this issue, users of :mod:`subprocess` or standalone Python
|
|
processes should set *track* to ``False`` when there is already another
|
|
process in place that does the bookkeeping.
|
|
*track* is ignored on Windows, which has its own tracking and
|
|
automatically deletes shared memory when all handles to it have been closed.
|
|
|
|
.. versionchanged:: 3.13
|
|
Added the *track* parameter.
|
|
|
|
.. method:: close()
|
|
|
|
Close the file descriptor/handle to the shared memory from this
|
|
instance. :meth:`close` should be called once access to the shared
|
|
memory block from this instance is no longer needed. Depending
|
|
on operating system, the underlying memory may or may not be freed
|
|
even if all handles to it have been closed. To ensure proper cleanup,
|
|
use the :meth:`unlink` method.
|
|
|
|
.. method:: unlink()
|
|
|
|
Delete the underlying shared memory block. This should be called only
|
|
once per shared memory block regardless of the number of handles to it,
|
|
even in other processes.
|
|
:meth:`unlink` and :meth:`close` can be called in any order, but
|
|
trying to access data inside a shared memory block after :meth:`unlink`
|
|
may result in memory access errors, depending on platform.
|
|
|
|
This method has no effect on Windows, where the only way to delete a
|
|
shared memory block is to close all handles.
|
|
|
|
.. attribute:: buf
|
|
|
|
A memoryview of contents of the shared memory block.
|
|
|
|
.. attribute:: name
|
|
|
|
Read-only access to the unique name of the shared memory block.
|
|
|
|
.. attribute:: size
|
|
|
|
Read-only access to size in bytes of the shared memory block.
|
|
|
|
|
|
The following example demonstrates low-level use of :class:`SharedMemory`
|
|
instances::
|
|
|
|
>>> from multiprocessing import shared_memory
|
|
>>> shm_a = shared_memory.SharedMemory(create=True, size=10)
|
|
>>> type(shm_a.buf)
|
|
<class 'memoryview'>
|
|
>>> buffer = shm_a.buf
|
|
>>> len(buffer)
|
|
10
|
|
>>> buffer[:4] = bytearray([22, 33, 44, 55]) # Modify multiple at once
|
|
>>> buffer[4] = 100 # Modify single byte at a time
|
|
>>> # Attach to an existing shared memory block
|
|
>>> shm_b = shared_memory.SharedMemory(shm_a.name)
|
|
>>> import array
|
|
>>> array.array('b', shm_b.buf[:5]) # Copy the data into a new array.array
|
|
array('b', [22, 33, 44, 55, 100])
|
|
>>> shm_b.buf[:5] = b'howdy' # Modify via shm_b using bytes
|
|
>>> bytes(shm_a.buf[:5]) # Access via shm_a
|
|
b'howdy'
|
|
>>> shm_b.close() # Close each SharedMemory instance
|
|
>>> shm_a.close()
|
|
>>> shm_a.unlink() # Call unlink only once to release the shared memory
|
|
|
|
|
|
|
|
The following example demonstrates a practical use of the :class:`SharedMemory`
|
|
class with `NumPy arrays <https://numpy.org/>`_, accessing the
|
|
same :class:`!numpy.ndarray` from two distinct Python shells:
|
|
|
|
.. doctest::
|
|
:options: +SKIP
|
|
|
|
>>> # In the first Python interactive shell
|
|
>>> import numpy as np
|
|
>>> a = np.array([1, 1, 2, 3, 5, 8]) # Start with an existing NumPy array
|
|
>>> from multiprocessing import shared_memory
|
|
>>> shm = shared_memory.SharedMemory(create=True, size=a.nbytes)
|
|
>>> # Now create a NumPy array backed by shared memory
|
|
>>> b = np.ndarray(a.shape, dtype=a.dtype, buffer=shm.buf)
|
|
>>> b[:] = a[:] # Copy the original data into shared memory
|
|
>>> b
|
|
array([1, 1, 2, 3, 5, 8])
|
|
>>> type(b)
|
|
<class 'numpy.ndarray'>
|
|
>>> type(a)
|
|
<class 'numpy.ndarray'>
|
|
>>> shm.name # We did not specify a name so one was chosen for us
|
|
'psm_21467_46075'
|
|
|
|
>>> # In either the same shell or a new Python shell on the same machine
|
|
>>> import numpy as np
|
|
>>> from multiprocessing import shared_memory
|
|
>>> # Attach to the existing shared memory block
|
|
>>> existing_shm = shared_memory.SharedMemory(name='psm_21467_46075')
|
|
>>> # Note that a.shape is (6,) and a.dtype is np.int64 in this example
|
|
>>> c = np.ndarray((6,), dtype=np.int64, buffer=existing_shm.buf)
|
|
>>> c
|
|
array([1, 1, 2, 3, 5, 8])
|
|
>>> c[-1] = 888
|
|
>>> c
|
|
array([ 1, 1, 2, 3, 5, 888])
|
|
|
|
>>> # Back in the first Python interactive shell, b reflects this change
|
|
>>> b
|
|
array([ 1, 1, 2, 3, 5, 888])
|
|
|
|
>>> # Clean up from within the second Python shell
|
|
>>> del c # Unnecessary; merely emphasizing the array is no longer used
|
|
>>> existing_shm.close()
|
|
|
|
>>> # Clean up from within the first Python shell
|
|
>>> del b # Unnecessary; merely emphasizing the array is no longer used
|
|
>>> shm.close()
|
|
>>> shm.unlink() # Free and release the shared memory block at the very end
|
|
|
|
|
|
.. class:: SharedMemoryManager([address[, authkey]])
|
|
:module: multiprocessing.managers
|
|
|
|
A subclass of :class:`multiprocessing.managers.BaseManager` which can be
|
|
used for the management of shared memory blocks across processes.
|
|
|
|
A call to :meth:`~multiprocessing.managers.BaseManager.start` on a
|
|
:class:`!SharedMemoryManager` instance causes a new process to be started.
|
|
This new process's sole purpose is to manage the life cycle
|
|
of all shared memory blocks created through it. To trigger the release
|
|
of all shared memory blocks managed by that process, call
|
|
:meth:`~multiprocessing.managers.BaseManager.shutdown` on the instance.
|
|
This triggers a :meth:`~multiprocessing.shared_memory.SharedMemory.unlink` call
|
|
on all of the :class:`SharedMemory` objects managed by that process and then
|
|
stops the process itself. By creating :class:`!SharedMemory` instances
|
|
through a :class:`!SharedMemoryManager`, we avoid the need to manually track
|
|
and trigger the freeing of shared memory resources.
|
|
|
|
This class provides methods for creating and returning :class:`SharedMemory`
|
|
instances and for creating a list-like object (:class:`ShareableList`)
|
|
backed by shared memory.
|
|
|
|
Refer to :class:`~multiprocessing.managers.BaseManager` for a description
|
|
of the inherited *address* and *authkey* optional input arguments and how
|
|
they may be used to connect to an existing :class:`!SharedMemoryManager` service
|
|
from other processes.
|
|
|
|
.. method:: SharedMemory(size)
|
|
|
|
Create and return a new :class:`SharedMemory` object with the
|
|
specified *size* in bytes.
|
|
|
|
.. method:: ShareableList(sequence)
|
|
|
|
Create and return a new :class:`ShareableList` object, initialized
|
|
by the values from the input *sequence*.
|
|
|
|
|
|
The following example demonstrates the basic mechanisms of a
|
|
:class:`~multiprocessing.managers.SharedMemoryManager`:
|
|
|
|
.. doctest::
|
|
:options: +SKIP
|
|
|
|
>>> from multiprocessing.managers import SharedMemoryManager
|
|
>>> smm = SharedMemoryManager()
|
|
>>> smm.start() # Start the process that manages the shared memory blocks
|
|
>>> sl = smm.ShareableList(range(4))
|
|
>>> sl
|
|
ShareableList([0, 1, 2, 3], name='psm_6572_7512')
|
|
>>> raw_shm = smm.SharedMemory(size=128)
|
|
>>> another_sl = smm.ShareableList('alpha')
|
|
>>> another_sl
|
|
ShareableList(['a', 'l', 'p', 'h', 'a'], name='psm_6572_12221')
|
|
>>> smm.shutdown() # Calls unlink() on sl, raw_shm, and another_sl
|
|
|
|
The following example depicts a potentially more convenient pattern for using
|
|
:class:`~multiprocessing.managers.SharedMemoryManager` objects via the
|
|
:keyword:`with` statement to ensure that all shared memory blocks are released
|
|
after they are no longer needed:
|
|
|
|
.. doctest::
|
|
:options: +SKIP
|
|
|
|
>>> with SharedMemoryManager() as smm:
|
|
... sl = smm.ShareableList(range(2000))
|
|
... # Divide the work among two processes, storing partial results in sl
|
|
... p1 = Process(target=do_work, args=(sl, 0, 1000))
|
|
... p2 = Process(target=do_work, args=(sl, 1000, 2000))
|
|
... p1.start()
|
|
... p2.start() # A multiprocessing.Pool might be more efficient
|
|
... p1.join()
|
|
... p2.join() # Wait for all work to complete in both processes
|
|
... total_result = sum(sl) # Consolidate the partial results now in sl
|
|
|
|
When using a :class:`~multiprocessing.managers.SharedMemoryManager`
|
|
in a :keyword:`with` statement, the shared memory blocks created using that
|
|
manager are all released when the :keyword:`!with` statement's code block
|
|
finishes execution.
|
|
|
|
|
|
.. class:: ShareableList(sequence=None, *, name=None)
|
|
|
|
Provide a mutable list-like object where all values stored within are
|
|
stored in a shared memory block.
|
|
This constrains storable values to the following built-in data types:
|
|
|
|
* :class:`int` (signed 64-bit)
|
|
* :class:`float`
|
|
* :class:`bool`
|
|
* :class:`str` (less than 10M bytes each when encoded as UTF-8)
|
|
* :class:`bytes` (less than 10M bytes each)
|
|
* ``None``
|
|
|
|
It also notably differs from the built-in :class:`list` type
|
|
in that these lists can not change their overall length
|
|
(i.e. no :meth:`!append`, :meth:`!insert`, etc.) and do not
|
|
support the dynamic creation of new :class:`!ShareableList` instances
|
|
via slicing.
|
|
|
|
*sequence* is used in populating a new :class:`!ShareableList` full of values.
|
|
Set to ``None`` to instead attach to an already existing
|
|
:class:`!ShareableList` by its unique shared memory name.
|
|
|
|
*name* is the unique name for the requested shared memory, as described
|
|
in the definition for :class:`SharedMemory`. When attaching to an
|
|
existing :class:`!ShareableList`, specify its shared memory block's unique
|
|
name while leaving *sequence* set to ``None``.
|
|
|
|
.. note::
|
|
|
|
A known issue exists for :class:`bytes` and :class:`str` values.
|
|
If they end with ``\x00`` nul bytes or characters, those may be
|
|
*silently stripped* when fetching them by index from the
|
|
:class:`!ShareableList`. This ``.rstrip(b'\x00')`` behavior is
|
|
considered a bug and may go away in the future. See :gh:`106939`.
|
|
|
|
For applications where rstripping of trailing nulls is a problem,
|
|
work around it by always unconditionally appending an extra non-0
|
|
byte to the end of such values when storing and unconditionally
|
|
removing it when fetching:
|
|
|
|
.. doctest::
|
|
|
|
>>> from multiprocessing import shared_memory
|
|
>>> nul_bug_demo = shared_memory.ShareableList(['?\x00', b'\x03\x02\x01\x00\x00\x00'])
|
|
>>> nul_bug_demo[0]
|
|
'?'
|
|
>>> nul_bug_demo[1]
|
|
b'\x03\x02\x01'
|
|
>>> nul_bug_demo.shm.unlink()
|
|
>>> padded = shared_memory.ShareableList(['?\x00\x07', b'\x03\x02\x01\x00\x00\x00\x07'])
|
|
>>> padded[0][:-1]
|
|
'?\x00'
|
|
>>> padded[1][:-1]
|
|
b'\x03\x02\x01\x00\x00\x00'
|
|
>>> padded.shm.unlink()
|
|
|
|
.. method:: count(value)
|
|
|
|
Return the number of occurrences of *value*.
|
|
|
|
.. method:: index(value)
|
|
|
|
Return first index position of *value*.
|
|
Raise :exc:`ValueError` if *value* is not present.
|
|
|
|
.. attribute:: format
|
|
|
|
Read-only attribute containing the :mod:`struct` packing format used by
|
|
all currently stored values.
|
|
|
|
.. attribute:: shm
|
|
|
|
The :class:`SharedMemory` instance where the values are stored.
|
|
|
|
|
|
The following example demonstrates basic use of a :class:`ShareableList`
|
|
instance:
|
|
|
|
>>> from multiprocessing import shared_memory
|
|
>>> a = shared_memory.ShareableList(['howdy', b'HoWdY', -273.154, 100, None, True, 42])
|
|
>>> [ type(entry) for entry in a ]
|
|
[<class 'str'>, <class 'bytes'>, <class 'float'>, <class 'int'>, <class 'NoneType'>, <class 'bool'>, <class 'int'>]
|
|
>>> a[2]
|
|
-273.154
|
|
>>> a[2] = -78.5
|
|
>>> a[2]
|
|
-78.5
|
|
>>> a[2] = 'dry ice' # Changing data types is supported as well
|
|
>>> a[2]
|
|
'dry ice'
|
|
>>> a[2] = 'larger than previously allocated storage space'
|
|
Traceback (most recent call last):
|
|
...
|
|
ValueError: exceeds available storage for existing str
|
|
>>> a[2]
|
|
'dry ice'
|
|
>>> len(a)
|
|
7
|
|
>>> a.index(42)
|
|
6
|
|
>>> a.count(b'howdy')
|
|
0
|
|
>>> a.count(b'HoWdY')
|
|
1
|
|
>>> a.shm.close()
|
|
>>> a.shm.unlink()
|
|
>>> del a # Use of a ShareableList after call to unlink() is unsupported
|
|
|
|
The following example depicts how one, two, or many processes may access the
|
|
same :class:`ShareableList` by supplying the name of the shared memory block
|
|
behind it:
|
|
|
|
>>> b = shared_memory.ShareableList(range(5)) # In a first process
|
|
>>> c = shared_memory.ShareableList(name=b.shm.name) # In a second process
|
|
>>> c
|
|
ShareableList([0, 1, 2, 3, 4], name='...')
|
|
>>> c[-1] = -999
|
|
>>> b[-1]
|
|
-999
|
|
>>> b.shm.close()
|
|
>>> c.shm.close()
|
|
>>> c.shm.unlink()
|
|
|
|
The following examples demonstrates that :class:`ShareableList`
|
|
(and underlying :class:`SharedMemory`) objects
|
|
can be pickled and unpickled if needed.
|
|
Note, that it will still be the same shared object.
|
|
This happens, because the deserialized object has
|
|
the same unique name and is just attached to an existing
|
|
object with the same name (if the object is still alive):
|
|
|
|
>>> import pickle
|
|
>>> from multiprocessing import shared_memory
|
|
>>> sl = shared_memory.ShareableList(range(10))
|
|
>>> list(sl)
|
|
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
|
|
|
|
>>> deserialized_sl = pickle.loads(pickle.dumps(sl))
|
|
>>> list(deserialized_sl)
|
|
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
|
|
|
|
>>> sl[0] = -1
|
|
>>> deserialized_sl[1] = -2
|
|
>>> list(sl)
|
|
[-1, -2, 2, 3, 4, 5, 6, 7, 8, 9]
|
|
>>> list(deserialized_sl)
|
|
[-1, -2, 2, 3, 4, 5, 6, 7, 8, 9]
|
|
|
|
>>> sl.shm.close()
|
|
>>> sl.shm.unlink()
|