How to document class attributes in Python?

Question

I'm writing a lightweight class whose attributes are intended to be publicly accessible, and only sometimes overridden in specific instantiations. There's no provision in the Python language for creating docstrings for class attributes, or any sort of attributes, for that matter. What is the expected and supported way, should there be one, to document these attributes? Currently I'm doing this sort of thing:

class Albatross(object):
    """A bird with a flight speed exceeding that of an unladen swallow.

    Attributes:
    """

    flight_speed = 691
    __doc__ += """
        flight_speed (691)
          The maximum speed that such a bird can attain.
    """

    nesting_grounds = "Raymond Luxury-Yacht"
    __doc__ += """
        nesting_grounds ("Raymond Luxury-Yacht")
          The locale where these birds congregate to reproduce.
    """

    def __init__(self, **keyargs):
        """Initialize the Albatross from the keyword arguments."""
        self.__dict__.update(keyargs)

This will result in the class's docstring containing the initial standard docstring section, as well as the lines added for each attribute via augmented assignment to __doc__.

Although this style doesn't seem to be expressly forbidden in the docstring style guidelines, it's also not mentioned as an option. The advantage here is that it provides a way to document attributes alongside their definitions, while still creating a presentable class docstring, and avoiding having to write comments that reiterate the information from the docstring. I'm still kind of annoyed that I have to actually write the attributes twice; I'm considering using the string representations of the values in the docstring to at least avoid duplication of the default values.

Is this a heinous breach of the ad hoc community conventions? Is it okay? Is there a better way? For example, it's possible to create a dictionary containing values and docstrings for the attributes and then add the contents to the class __dict__ and docstring towards the end of the class declaration; this would alleviate the need to type the attribute names and values twice. edit: this last idea is, I think, not actually possible, at least not without dynamically building the entire class from data, which seems like a really bad idea unless there's some other reason to do that.

I'm pretty new to python and still working out the details of coding style, so unrelated critiques are also welcome.

If you're looking for a way to document Django model attributes, this might be helpful: djangosnippets.org/snippets/2533 — Michael Scheper, Commented Dec 3, 2014 at 5:05
Duplicate of How to document fields and properties in Python? which hold a different solution. — bufh, Commented Aug 6, 2015 at 8:25
I don't get why this is opinion based. Python specifically documents it's acceptable conventions in PEPs. There are different Python source tools that extract properly formatted documentation. In fact Python actually has an attribute doc string mentioned in PEP 257 that isn't well known and seems hard to find that may answer the OPs question, and is supported by some source tools. This is not opinion. It's fact, and part of the language, and pretty much exactly what the OP wants. — NeilG, Commented Mar 12, 2020 at 9:20

ʇsәɹoɈ · Accepted Answer · 2023-02-10 21:01:36Z

133

In short: class attributes cannot have doc strings in the way that classes and functions have.

To avoid confusion, the term property has a specific meaning in python. What you're talking about is what we call class attributes. Since they are always acted upon through their class, I find that it makes sense to document them within the class' doc string. Something like this:

class Albatross(object):
    """A bird with a flight speed exceeding that of an unladen swallow.

    Attributes:
        flight_speed     The maximum speed that such a bird can attain.
        nesting_grounds  The locale where these birds congregate to reproduce.
    """
    flight_speed = 691
    nesting_grounds = "Throatwarbler Man Grove"

I think that's a lot easier on the eyes than the approach in your example. If I really wanted a copy of the attribute values to appear in the doc string, I would put them beside or below the description of each attribute.

Keep in mind that in Python, doc strings are actual members of the objects they document, not merely source code annotations. Since class attribute variables are not objects themselves but references to objects, they have no way of holding doc strings of their own. I guess you could make a case for doc strings on references, perhaps to describe "what should go here" instead of "what is actually here", but I find it easy enough to do that in the containing class doc string.

edited Feb 10, 2023 at 21:01

answered Jun 16, 2010 at 7:25

ʇsәɹoɈ

23.2k7 gold badges56 silver badges61 bronze badges

I guess in most cases this is fine, since the attributes —thanks for the terminology correction— are succinctly enough declared that they can just be grouped at the beginning of the class declaration without making it impractical to flip back and forth to either {read both the documentation and the default value} or {update both instances of the documentation and/or default value}.
– intuited
Commented Jun 16, 2010 at 8:08
1

Also note that my example will cause the documentation for the attributes to appear in the class's docstring. I actually would prefer to put the documentation in docstrings of the attributes themselves, but this doesn't work for most builtin types.
– intuited
Commented Jun 16, 2010 at 8:22
Yes, my initial idea was to just declare e.g. flight_speed = 691; flight_speed.__doc__ = "blah blah". I think this is what you're mentioning in your edit. Unfortunately, this doesn't work for instantiations of (most?) builtin types (like int in that example). It does work for instantiations of user-defined types. =========== There was actually a PEP (sorry, forget the number) which proposed adding docstrings for class/module attributes, but it was declined because they couldn't figure out a way to make it clear whether the docstrings were for the preceding or following attributes.
– intuited
Commented Sep 8, 2010 at 10:50
2

so what if they are instance attributes? still document in the class docstring or what?
– n611x007
Commented Jan 9, 2015 at 15:38
2

@intuited Was it this PEP? legacy.python.org/dev/peps/pep-0224
– taz
Commented Feb 25, 2015 at 15:48

| Show 2 more comments

Niko Fohr · Accepted Answer · 2023-11-12 12:31:58Z

71

The other answers are very outdated. PEP-257 describes how you can use docstrings for attributes. They come after the attribute, weirdly:

String literals occurring elsewhere in Python code may also act as documentation. They are not recognized by the Python bytecode compiler and are not accessible as runtime object attributes (i.e. not assigned to __doc__), but two types of extra docstrings may be extracted by software tools:

String literals occurring immediately after a simple assignment at the top level of a module, class, or __init__ method are called “attribute docstrings”.

class C:
    "class C doc-string"

    a = 1
    "attribute C.a doc-string (1)"

    b = 2
    "attribute C.b doc-string (2)"

It also works for type annotations like this:

class C:
    "class C doc-string"

    a: int
    "attribute C.a doc-string (1)"

    b: str
    "attribute C.b doc-string (2)"

VSCode supports showing these.

edited Nov 12, 2023 at 12:31

Niko Fohr

31.9k11 gold badges104 silver badges109 bronze badges

answered Oct 28, 2021 at 13:13

Timmmm

93.6k75 gold badges394 silver badges549 bronze badges

26

PEP 224 was rejected, but this answer is still useful because numerous tools in the Python ecosystem support this way of defining attribute docstrings.
– Will Da Silva
Commented Oct 31, 2021 at 0:41
4

Oh yeah I missed that. However it is just a convention and it seems that to have widespread support in spite of Guido saying he didn't like it because the strings come after the docs (which is weird tbf). E.g. here, Sphinx supports it and Pylance supports it. It's the de facto standard.
– Timmmm
Commented Oct 31, 2021 at 16:22
33

This behaviour is also documented in PEP 257 so it is part of the standard: "String literals occurring immediately after a simple assignment at the top level of a module, class, or `__init__`` method are called "attribute docstrings"."
– phiresky
Commented Jan 7, 2022 at 13:02
5

I do not see placing the docstrings after the variables weird because docstrings in Python always come after: def, class, module file start. (I acknowledge that in other programming languages this could be different.) I would really welcome official support for them.
– pabouk - Ukraine stay strong
Commented Jul 6, 2022 at 21:50
5

@phiresky There doesn't seem to be a way to retrieve them though. help(C) doesn't describe the attribute docstrings.
– user48956
Commented Aug 15, 2023 at 20:25

| Show 4 more comments

Matthew Hegarty · Accepted Answer · 2023-05-09 15:15:11Z

53

You cite the PEP257: Docstring Conventions, in the section What is a docstring it is stated:

String literals occurring elsewhere in Python code may also act as documentation. They are not recognized by the Python bytecode compiler and are not accessible as runtime object attributes (i.e. not assigned to __doc__), but two types of extra docstrings may be extracted by software tools:

String literals occurring immediately after a simple assignment at the top level of a module, class, or __init__ method are called "attribute docstrings".

And this is explained in more details in the PEP 258: Attribute Docstrings section. As explains above, an attribute is not an object that can own a __doc__ so they won't appear in help() or pydoc. These docstrings can only be used for generated documentation.

They are used in Sphinx with the directive autoattribute.

Sphinx can use comments on a line before an assignment or a special comment following an assignment or a docstring after the definition which will be autodocumented.

edited May 9, 2023 at 15:15

Matthew Hegarty

4,1112 gold badges29 silver badges45 bronze badges

answered Mar 4, 2012 at 20:52

marcz

1,23813 silver badges12 bronze badges

1

jedi-vim plugin also recognize attribute docstrings.
– Long Vu
Commented Aug 29, 2013 at 17:00
2

I don't know when this was introduced, but Sphinx 1.2.2 seems to include attribute docstrings in the generated documentation.
– jochen
Commented Jul 19, 2014 at 12:09
5

Please note that PEP 258 is rejected. The rejection notice states: "While this may serve as an interesting design document for the now-independent docutils, it is no longer slated for inclusion in the standard library."
– Michał Łazowik
Commented Nov 14, 2018 at 11:54
1

VS Code's Pylance supports attribute docstrings since 2021.7.6 (released 2021-07): github.com/microsoft/pylance-release/issues/1576
– pabouk - Ukraine stay strong
Commented Nov 5, 2021 at 8:29

Add a comment |

gerrit · Accepted Answer · 2019-04-09 14:16:24Z

You could abuse properties to this effect. Properties contain a getter, a setter, a deleter, and a docstring. Naively, this would get very verbose:

class C:
    def __init__(self):
        self._x = None

    @property
    def x(self):
        """Docstring goes here."""
        return self._x

    @x.setter
    def x(self, value):
        self._x = value

    @x.deleter
    def x(self):
        del self._x

Then you will have a docstring belonging to C.x:

In [24]: print(C.x.__doc__)
Docstring goes here.

To do this for many attributes is cumbersome, but you could envision a helper function myprop:

def myprop(x, doc):
    def getx(self):
        return getattr(self, '_' + x)

    def setx(self, val):
        setattr(self, '_' + x, val)

    def delx(self):
        delattr(self, '_' + x)

    return property(getx, setx, delx, doc)

class C:
    a = myprop("a", "Hi, I'm A!")
    b = myprop("b", "Hi, I'm B!")

In [44]: c = C()

In [46]: c.b = 42

In [47]: c.b
Out[47]: 42

In [49]: print(C.b.__doc__)
Hi, I'm B!

Then, calling Pythons interactive help will give:

Help on class C in module __main__:

class C
 |  Data descriptors defined here:
 |  
 |  a
 |      Hi, I'm A!
 |  
 |  b
 |      Hi, I'm B!

which I think should be pretty much what you're after.

Edit: I realise now that we can perhaps avoid to need to pass the first argument to myprop at all, because the internal name doesn't matter. If subsequent calls of myprop can somehow communicate with each other, it could automatically decide upon a long and unlikely internal attribute name. I'm sure there are ways to implement this, but I'm not sure if they're worth it.

Interesting solution but unless Python does some magic under the hull creating a function and calling it just to access an attribute is unnecessary overhead. I understand that the OP is asking about documenting attributes but adding all that (especially the last one with the nested functions -_-) is way too much. — rbaleksandar, Commented Oct 1, 2021 at 10:51
@rbaleksandar You're right. I posted this more than 8 years ago, and practice shows that I never do this myself. However, I still think it shows some information about properties having docstrings, which may be of interest to some. — gerrit, Commented Oct 1, 2021 at 12:21
Note that it's not actually necessary to provide __get__ et. al., so attribute lookups don't need to do the indirection. The documentation will appear for any object that has its own __doc__; for testing types.SimpleNamespace(__doc__='hello') works but I prefer to use one that has an empty __repr__. — o11c, Commented Jun 25, 2022 at 22:39

Hùng Nguyễn · Accepted Answer · 2023-12-08 17:10:03Z

Here's an answer that abuses ast and inspect. It does nothing to the original class implementation besides changing the docstring.

The idea

Loop through all the expressions in the class body,
Check if any string expression appears right before an attribute,
Store such attribute as an attribute docstring
Create fancy formatting for the attribute docstrings and append it to the class's existing docstring.

The implementation

import ast
import inspect
from io import StringIO


def ast_find_classdef(tree):
    for e in ast.walk(tree):
        if isinstance(e, ast.ClassDef):
            return e


def attribute_docs(cls):
    """Enable attribute documentations for (data)classes. Use this function as a decorator.

    ```
    @attribute_docs
    @dataclass
    class TargetCoder:
        "Target coder for object detection"

        "Number of detection classes"
        num_classes: int

        "Normalized minimum bounding box size"
        min_size: flaot
    ```
    """
    # == find the class defining syntax tree ==
    src = inspect.getsource(cls)
    tree = ast.parse(src)
    tree = ast_find_classdef(tree)

    # == gather attribute doc strings ==
    # * We skip the first expr, because it is either a class docstring or something else
    # * The idea is that docstring appears on top of the attribute.
    # * Therefore, we search for any string node, mark that as a docstring.
    # * If a class attribute define node appears after the docstring, we store the docstring
    #    along with the class attribute's information
    attribute_docs = {}
    last_doc: Optional[str] = None
    for expr in tree.body[1:]:
        # When encouter an Expr, check if the expr a string
        if isinstance(expr, ast.Expr):
            # The value is a ast.Value node
            # therefore another access to value is needed
            value = expr.value.value
            if isinstance(value, str):
                last_doc = value.strip()

        # if the last known doc string is not none
        # and this next node is an annotation, that's a docstring
        if isinstance(expr, ast.AnnAssign) and last_doc is not None:
            # expr.target is a ast.Name
            name = ast.unparse(expr.target)
            type_name = ast.unparse(expr.annotation)
            attribute_docs[name] = (type_name, last_doc)
            last_doc = None

    # == Append to the class documentation ==
    # * if there is no attribute docstring, leave it be
    if len(attribute_docs) > 0:
        old_docs = cls.__doc__
        append_docs = build_attibute_docstrings(attribute_docs)
        cls.__doc__ = f"""{old_docs}\n\n{append_docs}"""
    return cls


def build_attibute_docstrings(docs):
    # Create pretty formatting for the attribute docs
    with StringIO() as io:
        io.write("Attributes:\n")
        for var_name, (type_name, docstring) in docs.items():
            # == Multiline vs inline doc format ==
            # * if the doc is inline, simply use the `x (type): docstring`
            # * if the doc is multiline, create a new paragraph
            if "\n" in docstring:
                lines = docstring.split("\n")
                lines = ["\t\t" + line.strip() for line in lines]
                docstring = "\n".join(lines)
                line = f"\t{var_name} ({type_name}):\n{docstring}\n"
            else:
                line = f"\t{var_name} ({type_name}): {docstring}\n"

            # Add the docstring line
            io.write(line)
        io.seek(0)
        docstring = io.read()
    return docstring

The example

@attribute_docs
@dataclass
class DBNetAlignCoder:
    "DBNet target coder for aligned case, i.e. detection targets are axis-aligned"

    "Number of detection classes"
    num_classes: int

    "Input image width"
    image_width: int

    "Input image height"
    image_height: int

    """Shrink rate of bounding boxes, the shrink distance will be computed using
    [A * (1 - r^2) / L], where A is the bounding box area, L is the bounding box
    perimeter, and r is the shrink ratio
    """
    shrink_ratio: float

    "Minimum probability to be considered a positive detection"
    det_threshold: float

    """
    Whether to use a simple threshold map drawing method. If true, the threshold
    map values will be 1, instead of the distance from shrink/expand boxes to the
    actual boxes as described in the DBNet paper.
    """
    simple_threshold: bool = False

The output of help():

class DBNetAlignCoder(builtins.object)
 |  DBNetAlignCoder(num_classes: int, image_width: int, image_height: int, shrink_ratio: float, det_threshold: float, simple_threshold: bool = False) -> None
 |  
 |  DBNet target coder for aligned case, i.e. detection targets are axis-aligned
 |  
 |  Attributes:
 |          num_classes (int): Number of detection classes
 |          image_width (int): Input image width
 |          image_height (int): Input image height
 |          shrink_ratio (float):
 |                  Shrink rate of bounding boxes, the shrink distance will be computed using
 |                  [A * (1 - r^2) / L], where A is the bounding box area, L is the bounding box
 |                  perimeter, and r is the shrink ratio
 |          det_threshold (float): Minimum probability to be considered a positive detection
 |          simple_threshold (bool):
 |                  Whether to use a simple threshold map drawing method. If true, the threshold
 |                  map values will be 1, instead of the distance from shrink/expand boxes to the
 |                  actual boxes as described in the DBNet paper.

Collectives™ on Stack Overflow

How to document class attributes in Python?

5 Answers 5

The idea

The implementation

The example

Not the answer you're looking for? Browse other questions tagged
python
class
documentation
docstring
class-attributes
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

The idea

The implementation

The example

Not the answer you're looking for? Browse other questions tagged pythonclassdocumentationdocstringclass-attributes or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
python
class
documentation
docstring
class-attributes
or ask your own question.