Cost of len() function

Question

What is the cost of len() function for Python built-ins? (list/tuple/string/dictionary)

kcpr · Accepted Answer · 2017-02-27 15:22:24Z

505

It's O(1) (constant time, not depending of actual length of the element - very fast) on every type you've mentioned, plus set and others such as array.array.

edited Feb 27, 2017 at 15:22

kcpr

1,0891 gold badge12 silver badges29 bronze badges

answered Jul 12, 2009 at 4:40

Alex Martelli

873k174 gold badges1.2k silver badges1.4k bronze badges

3

interesting that get length runtime is only mentioned for list here - wiki.python.org/moin/TimeComplexity [not mentioned for other types]
– Chaitanya Bapat
Commented May 17, 2021 at 0:07
1

But why is it O(1)?
– Freddy Mcloughlan
Commented May 18, 2022 at 12:52
4

len() is a very frequent operation, and making it O(1) is extremely easy from the viewpoint of implementation -- Python just keeps each collection's "number of items" (length) stored and updated as part of the collection data structure.
– Alex Martelli
Commented May 18, 2022 at 17:02
I assume its only O(1) because it was already calculated at time of creation and getting len(x) is just accessing that stored value
– Kevin
Commented Jun 19, 2022 at 2:32

Add a comment |

James Thompson · Accepted Answer · 2022-04-26 01:08:26Z

166

Calling len() on those data types is O(1) in CPython, the official and most common implementation of the Python language. Here's a link to a table that provides the algorithmic complexity of many different functions in CPython:

TimeComplexity Python Wiki Page

edited Apr 26, 2022 at 1:08

user17242583

answered Jul 12, 2009 at 4:59

James Thompson

47.6k18 gold badges67 silver badges84 bronze badges

Thanks for the link. Unfortunately, it says nothing about str or tuple.
– Yaroslav Nikitenko
Commented Apr 5 at 16:57

Add a comment |

Wolf · Accepted Answer · 2015-07-22 10:15:36Z

125

All those objects keep track of their own length. The time to extract the length is small (O(1) in big-O notation) and mostly consists of [rough description, written in Python terms, not C terms]: look up "len" in a dictionary and dispatch it to the built_in len function which will look up the object's __len__ method and call that ... all it has to do is return self.length

edited Jul 22, 2015 at 10:15

Wolf

9,9557 gold badges64 silver badges110 bronze badges

answered Jul 12, 2009 at 6:17

John Machin

82.6k11 gold badges145 silver badges191 bronze badges

1

why doesn't length show up in dictionary by dir(list) ?
– ViFI
Commented Apr 26, 2020 at 2:38
@ViFI Because it is just a example. The illustrated list.lenght variable is implemented in C, not Python.
– Ekrem Dinçel
Commented Jun 16, 2020 at 15:51
For any of the built-in types, len(x) doesn't need to perform a dict lookup on CPython; the length getting function is stored in a dedicated slot in the PyTypeObject struct, that can be looked up at a constant offset, so finding that function involves: 1) Extracting the type from the object (a lookup at offset in C), 2) Looking up tp_len in the type (another lookup at offset in C), 3) Calling said function (C function call through function pointer), which 4) Looks up the actual length from the original instance (a final lookup at offset in C), then 5) Converts to Python level int.
– ShadowRanger
Commented Dec 15, 2023 at 22:16
Point is, no dict involved (aside from the global ones to find the built-in len itself, and modern CPython uses caches so it doesn't even do that most of the time); all the offsets are baked into the compiled code, so no strings are involved, let alone hashing and bucket lookups. A dict lookup is involved if you get the length by calling x.__len__(), which is why the __len__ method is slower than the len function.
– ShadowRanger
Commented Dec 15, 2023 at 22:17

Add a comment |

mechanical_meat · Accepted Answer · 2013-01-21 17:20:35Z

The below measurements provide evidence that len() is O(1) for oft-used data structures.

A note regarding timeit: When the -s flag is used and two strings are passed to timeit the first string is executed only once and is not timed.

List:

$ python -m timeit -s "l = range(10);" "len(l)"
10000000 loops, best of 3: 0.0677 usec per loop

$ python -m timeit -s "l = range(1000000);" "len(l)"
10000000 loops, best of 3: 0.0688 usec per loop

Tuple:

$ python -m timeit -s "t = (1,)*10;" "len(t)"
10000000 loops, best of 3: 0.0712 usec per loop

$ python -m timeit -s "t = (1,)*1000000;" "len(t)"
10000000 loops, best of 3: 0.0699 usec per loop

String:

$ python -m timeit -s "s = '1'*10;" "len(s)"
10000000 loops, best of 3: 0.0713 usec per loop

$ python -m timeit -s "s = '1'*1000000;" "len(s)"
10000000 loops, best of 3: 0.0686 usec per loop

Dictionary (dictionary-comprehension available in 2.7+):

$ python -mtimeit -s"d = {i:j for i,j in enumerate(range(10))};" "len(d)"
10000000 loops, best of 3: 0.0711 usec per loop

$ python -mtimeit -s"d = {i:j for i,j in enumerate(range(1000000))};" "len(d)"
10000000 loops, best of 3: 0.0727 usec per loop

Array:

$ python -mtimeit -s"import array;a=array.array('i',range(10));" "len(a)"
10000000 loops, best of 3: 0.0682 usec per loop

$ python -mtimeit -s"import array;a=array.array('i',range(1000000));" "len(a)"
10000000 loops, best of 3: 0.0753 usec per loop

Set (set-comprehension available in 2.7+):

$ python -mtimeit -s"s = {i for i in range(10)};" "len(s)"
10000000 loops, best of 3: 0.0754 usec per loop

$ python -mtimeit -s"s = {i for i in range(1000000)};" "len(s)"
10000000 loops, best of 3: 0.0713 usec per loop

Deque:

$ python -mtimeit -s"from collections import deque;d=deque(range(10));" "len(d)"
100000000 loops, best of 3: 0.0163 usec per loop

$ python -mtimeit -s"from collections import deque;d=deque(range(1000000));" "len(d)"
100000000 loops, best of 3: 0.0163 usec per loop

This is not so good of a benchmark even though it shows what we already know. This is because range(10) and range(1000000) is not supposed to be O(1). — Unknown, Commented Jul 12, 2009 at 5:45
This is by far the best answer. You should just add a conclusion just in case someone doesn't realize the constant time. — santiagobasulto, Commented Jan 21, 2013 at 13:14
Thanks for the comment. I added a note about the O(1) complexity of len(), and also fixed the measurements to properly use the -s flag. — mechanical_meat, Commented Jan 21, 2013 at 17:21
It is important to note that saving the length into a variable could save a significant amount of computational time: python -m timeit -s "l = range(10000);" "len(l); len(l); len(l)" 223 nsec per loop python -m timeit -s "l = range(100);" "len(l)" 66.2 nsec per loop — Radostin Stoyanov, Commented Jan 4, 2020 at 19:27

RAHUL KUMAR · Accepted Answer · 2018-09-02 07:02:29Z

46

len is an O(1) because in your RAM, lists are stored as tables (series of contiguous addresses). To know when the table stops the computer needs two things : length and start point. That is why len() is a O(1), the computer stores the value, so it just needs to look it up.

answered Sep 2, 2018 at 7:02

RAHUL KUMAR

1,16311 silver badges10 bronze badges

I don't think this is true for python lists. They're linked lists, not arrays, so contiguous addresses are not guaranteed
– bluppfisk
Commented Jul 5, 2023 at 6:34
3

@bluppfisk You are totally wrong. Here are the python docs docs.python.org/3/faq/…
– airled
Commented Aug 5, 2023 at 9:52

Add a comment |

progfan · Accepted Answer · 2023-09-10 21:03:55Z

It is O(1) in CPython because length is derived from the size attribute on the Pyobject representing the list. See [1], [2] and [3] in that order:

[1]:

static PyObject *
listiter_len(_PyListIterObject *it, PyObject *Py_UNUSED(ignored))
{
    Py_ssize_t len;
    if (it->it_seq) {
        len = PyList_GET_SIZE(it->it_seq) - it->it_index;
        if (len >= 0)
            return PyLong_FromSsize_t(len);
    }
    return PyLong_FromLong(0);
}

[2]:

static inline Py_ssize_t PyList_GET_SIZE(PyObject *op) {
    PyListObject *list = _PyList_CAST(op);
    return Py_SIZE(list);
}

[3]

static inline Py_ssize_t Py_SIZE(PyObject *ob) {
    assert(ob->ob_type != &PyLong_Type);
    assert(ob->ob_type != &PyBool_Type);
    PyVarObject *var_ob = _PyVarObject_CAST(ob);
    return var_ob->ob_size;
}

[1] listiter_len

[2] PyList_GET_SIZE

[3] Py_SIZE

Collectives™ on Stack Overflow

Cost of len() function

6 Answers 6

List:

Tuple:

String:

Dictionary (dictionary-comprehension available in 2.7+):

Array:

Set (set-comprehension available in 2.7+):

Deque:

Not the answer you're looking for? Browse other questions tagged
python
collections
time-complexity
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

List:

Tuple:

String:

Dictionary (dictionary-comprehension available in 2.7+):

Array:

Set (set-comprehension available in 2.7+):

Deque:

Not the answer you're looking for? Browse other questions tagged pythoncollectionstime-complexity or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
python
collections
time-complexity
or ask your own question.