pymalloc should align to max_align_t #91335

encukou · 2022-03-31T08:59:45Z

BPO	47179
Nosy	@ronaldoussoren, @pitrou, @vstinner, @encukou, @methane

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = None
created_at = <Date 2022-03-31.08:59:44.707>
labels = ['expert-C-API']
title = 'pymalloc should align to max_align_t'
updated_at = <Date 2022-04-07.19:36:04.980>
user = 'https://github.com/encukou'

bugs.python.org fields:

activity = <Date 2022-04-07.19:36:04.980>
actor = 'ronaldoussoren'
assignee = 'none'
closed = False
closed_date = None
closer = None
components = ['C API']
creation = <Date 2022-03-31.08:59:44.707>
creator = 'petr.viktorin'
dependencies = []
files = []
hgrepos = []
issue_num = 47179
keywords = []
message_count = 5.0
messages = ['416421', '416433', '416434', '416435', '416939']
nosy_count = 5.0
nosy_names = ['ronaldoussoren', 'pitrou', 'vstinner', 'petr.viktorin', 'methane']
pr_nums = []
priority = 'normal'
resolution = None
stage = None
status = 'open'
superseder = None
type = None
url = 'https://bugs.python.org/issue47179'
versions = []

encukou · 2022-03-31T08:59:45Z

malloc() returns memory that's "suitably aligned for any built-in type".
All of Python's allocation functions should do the same.

In bpo-27987 (PR-12850, PR-13336), the alignment was raised* to 16 bytes and long double. This is OK for current architectures, so there is no practical issue right now.

But, C11 defines a max_align_t type which sounds like the correct thing to use for determining pymalloc/PyGC_Head alignment.
At least we should assert that obmalloc's ALIGNMENT is a multiple of alignof(max_align_t), and use max_align_t rather than long double in PyGC_Head.

See also this python-cffi issue: https://foss.heptapod.net/pypy/cffi/-/issues/531#note_181779

(on 64-bit arches)

vstinner · 2022-03-31T13:39:20Z

On my x86-64 Fedora 35, gcc says 32 bytes for sizeof(max_align_t). By the way, g++ also says 32 bytes for sizeof(std::max_align_t).

GCC 11.2.1 defines max_align_t as:
---

#if (defined (__STDC_VERSION__) && __STDC_VERSION__ >= 201112L) \
  || (defined(__cplusplus) && __cplusplus >= 201103L)
#ifndef _GCC_MAX_ALIGN_T
#define _GCC_MAX_ALIGN_T
/* Type whose alignment is supported in every context and is at least
   as great as that of any standard type not using alignment
   specifiers.  */
typedef struct {
  long long __max_align_ll __attribute__((__aligned__(__alignof__(long long))));
  long double __max_align_ld __attribute__((__aligned__(__alignof__(long double))));
  /* _Float128 is defined as a basic type, so max_align_t must be
     sufficiently aligned for it.  This code must work in C++, so we
     use __float128 here; that is only available on some
     architectures, but only on i386 is extra alignment needed for
     __float128.  */
#ifdef __i386__
  __float128 __max_align_f128 __attribute__((__aligned__(__alignof(__float128))));
#endif
} max_align_t;
#endif
#endif /* C11 or C++11.  */

file: /usr/lib/gcc/x86_64-redhat-linux/11/include/stddef.h

It's not an union but a structure with 2 fields (1 long long, 1 long double). The __i386__ macro is not defined on Linux x86-64, so the struct does not have the 3rd 128-bit float field.

align.c:
---

#include <stddef.h>
int main() { return sizeof(max_align_t); }

Build and run (C):
---

$ gcc align.c -o align && (./align; echo $?)
32

align.cpp:
---

int main() { return sizeof(std::max_align_t); }

Build and run (C++):
---

$ g++ align.cpp -o align && (./align; echo $?)
32

vstinner · 2022-03-31T13:44:19Z

Objects/obmalloc.c currently relies on the SIZEOF_VOID_P macro:
---

#if SIZEOF_VOID_P > 4
#define ALIGNMENT              16               /* must be 2^N */
#define ALIGNMENT_SHIFT         4
#else
#define ALIGNMENT               8               /* must be 2^N */
#define ALIGNMENT_SHIFT         3
#endif

If we want to respect sizeof(max_align_t) alignment, we can compute sizeof(max_align_t) in configure and uses the result in obmalloc.c. I expect that it's either 16 or 32, so we can maybe just hardcode ALIGNMENT_SHIFT using something like: "if == 32 ... #elif == 16 ... #else #error ...".

On x86 (32-bit) Fedora 35, gcc says 48 for sizeof(max_align_t) which is way larger than the current alignment to 8 bytes!

vstinner · 2022-03-31T13:47:25Z

Oh, it seems like this issue is a duplicate of bpo-31912 created in 2017.

ronaldoussoren · 2022-04-07T19:36:05Z

If we want to respect sizeof(max_align_t) alignment, we can compute sizeof(max_align_t) in configure and uses the result in obmalloc.c. I expect that it's either 16 or 32, so we can maybe just hardcode ALIGNMENT_SHIFT using something like: "if == 32 ... #elif == 16 ... #else #error ...".

This should be "alignof(max_align_t)" instead of "sizeof(...)". The size itself is not relevant.

BTW, on macOS/arm64 alignof(max_align_t) is 8, not 16 (as the code seems to expect given the pointer size). This is harmless of course.

erlend-aasland · 2024-01-15T12:57:07Z

Oh, it seems like this issue is a duplicate of bpo-31912 created in 2017.

Closed #76093 as a duplicate of this issue.

encukou added topic-C-API labels Mar 31, 2022

ezio-melotti transferred this issue from another repository Apr 10, 2022

erlend-aasland mentioned this issue Jan 15, 2024

PyMem_Malloc() should guarantee alignof(max_align_t) #76093

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pymalloc should align to max_align_t #91335

pymalloc should align to max_align_t #91335

encukou commented Mar 31, 2022

encukou commented Mar 31, 2022

vstinner commented Mar 31, 2022

vstinner commented Mar 31, 2022

vstinner commented Mar 31, 2022

ronaldoussoren commented Apr 7, 2022

erlend-aasland commented Jan 15, 2024

pymalloc should align to max_align_t #91335

pymalloc should align to max_align_t #91335

Comments

encukou commented Mar 31, 2022

encukou commented Mar 31, 2022

vstinner commented Mar 31, 2022

vstinner commented Mar 31, 2022

vstinner commented Mar 31, 2022

ronaldoussoren commented Apr 7, 2022

erlend-aasland commented Jan 15, 2024