mirror of https://github.com/python/cpython.git
718 lines
17 KiB
ReStructuredText
718 lines
17 KiB
ReStructuredText
|
|
:mod:`dis` --- Disassembler for Python bytecode
|
|
===============================================
|
|
|
|
.. module:: dis
|
|
:synopsis: Disassembler for Python bytecode.
|
|
|
|
|
|
The :mod:`dis` module supports the analysis of Python :term:`bytecode` by disassembling
|
|
it. Since there is no Python assembler, this module defines the Python assembly
|
|
language. The Python bytecode which this module takes as an input is defined
|
|
in the file :file:`Include/opcode.h` and used by the compiler and the
|
|
interpreter.
|
|
|
|
Example: Given the function :func:`myfunc`::
|
|
|
|
def myfunc(alist):
|
|
return len(alist)
|
|
|
|
the following command can be used to get the disassembly of :func:`myfunc`::
|
|
|
|
>>> dis.dis(myfunc)
|
|
2 0 LOAD_GLOBAL 0 (len)
|
|
3 LOAD_FAST 0 (alist)
|
|
6 CALL_FUNCTION 1
|
|
9 RETURN_VALUE
|
|
|
|
(The "2" is a line number).
|
|
|
|
The :mod:`dis` module defines the following functions and constants:
|
|
|
|
|
|
.. function:: dis([bytesource])
|
|
|
|
Disassemble the *bytesource* object. *bytesource* can denote either a module, a
|
|
class, a method, a function, or a code object. For a module, it disassembles
|
|
all functions. For a class, it disassembles all methods. For a single code
|
|
sequence, it prints one line per bytecode instruction. If no object is
|
|
provided, it disassembles the last traceback.
|
|
|
|
|
|
.. function:: distb([tb])
|
|
|
|
Disassembles the top-of-stack function of a traceback, using the last traceback
|
|
if none was passed. The instruction causing the exception is indicated.
|
|
|
|
|
|
.. function:: disassemble(code[, lasti])
|
|
|
|
Disassembles a code object, indicating the last instruction if *lasti* was
|
|
provided. The output is divided in the following columns:
|
|
|
|
#. the line number, for the first instruction of each line
|
|
#. the current instruction, indicated as ``-->``,
|
|
#. a labelled instruction, indicated with ``>>``,
|
|
#. the address of the instruction,
|
|
#. the operation code name,
|
|
#. operation parameters, and
|
|
#. interpretation of the parameters in parentheses.
|
|
|
|
The parameter interpretation recognizes local and global variable names,
|
|
constant values, branch targets, and compare operators.
|
|
|
|
|
|
.. function:: disco(code[, lasti])
|
|
|
|
A synonym for disassemble. It is more convenient to type, and kept for
|
|
compatibility with earlier Python releases.
|
|
|
|
|
|
.. data:: opname
|
|
|
|
Sequence of operation names, indexable using the bytecode.
|
|
|
|
|
|
.. data:: opmap
|
|
|
|
Dictionary mapping bytecodes to operation names.
|
|
|
|
|
|
.. data:: cmp_op
|
|
|
|
Sequence of all compare operation names.
|
|
|
|
|
|
.. data:: hasconst
|
|
|
|
Sequence of bytecodes that have a constant parameter.
|
|
|
|
|
|
.. data:: hasfree
|
|
|
|
Sequence of bytecodes that access a free variable.
|
|
|
|
|
|
.. data:: hasname
|
|
|
|
Sequence of bytecodes that access an attribute by name.
|
|
|
|
|
|
.. data:: hasjrel
|
|
|
|
Sequence of bytecodes that have a relative jump target.
|
|
|
|
|
|
.. data:: hasjabs
|
|
|
|
Sequence of bytecodes that have an absolute jump target.
|
|
|
|
|
|
.. data:: haslocal
|
|
|
|
Sequence of bytecodes that access a local variable.
|
|
|
|
|
|
.. data:: hascompare
|
|
|
|
Sequence of bytecodes of Boolean operations.
|
|
|
|
|
|
.. _bytecodes:
|
|
|
|
Python Bytecode Instructions
|
|
----------------------------
|
|
|
|
The Python compiler currently generates the following bytecode instructions.
|
|
|
|
|
|
.. opcode:: STOP_CODE ()
|
|
|
|
Indicates end-of-code to the compiler, not used by the interpreter.
|
|
|
|
|
|
.. opcode:: NOP ()
|
|
|
|
Do nothing code. Used as a placeholder by the bytecode optimizer.
|
|
|
|
|
|
.. opcode:: POP_TOP ()
|
|
|
|
Removes the top-of-stack (TOS) item.
|
|
|
|
|
|
.. opcode:: ROT_TWO ()
|
|
|
|
Swaps the two top-most stack items.
|
|
|
|
|
|
.. opcode:: ROT_THREE ()
|
|
|
|
Lifts second and third stack item one position up, moves top down to position
|
|
three.
|
|
|
|
|
|
.. opcode:: ROT_FOUR ()
|
|
|
|
Lifts second, third and forth stack item one position up, moves top down to
|
|
position four.
|
|
|
|
|
|
.. opcode:: DUP_TOP ()
|
|
|
|
Duplicates the reference on top of the stack.
|
|
|
|
Unary Operations take the top of the stack, apply the operation, and push the
|
|
result back on the stack.
|
|
|
|
|
|
.. opcode:: UNARY_POSITIVE ()
|
|
|
|
Implements ``TOS = +TOS``.
|
|
|
|
|
|
.. opcode:: UNARY_NEGATIVE ()
|
|
|
|
Implements ``TOS = -TOS``.
|
|
|
|
|
|
.. opcode:: UNARY_NOT ()
|
|
|
|
Implements ``TOS = not TOS``.
|
|
|
|
|
|
.. opcode:: UNARY_INVERT ()
|
|
|
|
Implements ``TOS = ~TOS``.
|
|
|
|
|
|
.. opcode:: GET_ITER ()
|
|
|
|
Implements ``TOS = iter(TOS)``.
|
|
|
|
Binary operations remove the top of the stack (TOS) and the second top-most
|
|
stack item (TOS1) from the stack. They perform the operation, and put the
|
|
result back on the stack.
|
|
|
|
|
|
.. opcode:: BINARY_POWER ()
|
|
|
|
Implements ``TOS = TOS1 ** TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_MULTIPLY ()
|
|
|
|
Implements ``TOS = TOS1 * TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_FLOOR_DIVIDE ()
|
|
|
|
Implements ``TOS = TOS1 // TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_TRUE_DIVIDE ()
|
|
|
|
Implements ``TOS = TOS1 / TOS`` when ``from __future__ import division`` is in
|
|
effect.
|
|
|
|
|
|
.. opcode:: BINARY_MODULO ()
|
|
|
|
Implements ``TOS = TOS1 % TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_ADD ()
|
|
|
|
Implements ``TOS = TOS1 + TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_SUBTRACT ()
|
|
|
|
Implements ``TOS = TOS1 - TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_SUBSCR ()
|
|
|
|
Implements ``TOS = TOS1[TOS]``.
|
|
|
|
|
|
.. opcode:: BINARY_LSHIFT ()
|
|
|
|
Implements ``TOS = TOS1 << TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_RSHIFT ()
|
|
|
|
Implements ``TOS = TOS1 >> TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_AND ()
|
|
|
|
Implements ``TOS = TOS1 & TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_XOR ()
|
|
|
|
Implements ``TOS = TOS1 ^ TOS``.
|
|
|
|
|
|
.. opcode:: BINARY_OR ()
|
|
|
|
Implements ``TOS = TOS1 | TOS``.
|
|
|
|
In-place operations are like binary operations, in that they remove TOS and
|
|
TOS1, and push the result back on the stack, but the operation is done in-place
|
|
when TOS1 supports it, and the resulting TOS may be (but does not have to be)
|
|
the original TOS1.
|
|
|
|
|
|
.. opcode:: INPLACE_POWER ()
|
|
|
|
Implements in-place ``TOS = TOS1 ** TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_MULTIPLY ()
|
|
|
|
Implements in-place ``TOS = TOS1 * TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_FLOOR_DIVIDE ()
|
|
|
|
Implements in-place ``TOS = TOS1 // TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_TRUE_DIVIDE ()
|
|
|
|
Implements in-place ``TOS = TOS1 / TOS`` when ``from __future__ import
|
|
division`` is in effect.
|
|
|
|
|
|
.. opcode:: INPLACE_MODULO ()
|
|
|
|
Implements in-place ``TOS = TOS1 % TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_ADD ()
|
|
|
|
Implements in-place ``TOS = TOS1 + TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_SUBTRACT ()
|
|
|
|
Implements in-place ``TOS = TOS1 - TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_LSHIFT ()
|
|
|
|
Implements in-place ``TOS = TOS1 << TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_RSHIFT ()
|
|
|
|
Implements in-place ``TOS = TOS1 >> TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_AND ()
|
|
|
|
Implements in-place ``TOS = TOS1 & TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_XOR ()
|
|
|
|
Implements in-place ``TOS = TOS1 ^ TOS``.
|
|
|
|
|
|
.. opcode:: INPLACE_OR ()
|
|
|
|
Implements in-place ``TOS = TOS1 | TOS``.
|
|
|
|
|
|
.. opcode:: STORE_SUBSCR ()
|
|
|
|
Implements ``TOS1[TOS] = TOS2``.
|
|
|
|
|
|
.. opcode:: DELETE_SUBSCR ()
|
|
|
|
Implements ``del TOS1[TOS]``.
|
|
|
|
Miscellaneous opcodes.
|
|
|
|
|
|
.. opcode:: PRINT_EXPR ()
|
|
|
|
Implements the expression statement for the interactive mode. TOS is removed
|
|
from the stack and printed. In non-interactive mode, an expression statement is
|
|
terminated with ``POP_STACK``.
|
|
|
|
|
|
.. opcode:: BREAK_LOOP ()
|
|
|
|
Terminates a loop due to a :keyword:`break` statement.
|
|
|
|
|
|
.. opcode:: CONTINUE_LOOP (target)
|
|
|
|
Continues a loop due to a :keyword:`continue` statement. *target* is the
|
|
address to jump to (which should be a ``FOR_ITER`` instruction).
|
|
|
|
|
|
.. opcode:: SET_ADD ()
|
|
|
|
Calls ``set.add(TOS1, TOS)``. Used to implement set comprehensions.
|
|
|
|
|
|
.. opcode:: LIST_APPEND ()
|
|
|
|
Calls ``list.append(TOS1, TOS)``. Used to implement list comprehensions.
|
|
|
|
|
|
.. opcode:: LOAD_LOCALS ()
|
|
|
|
Pushes a reference to the locals of the current scope on the stack. This is used
|
|
in the code for a class definition: After the class body is evaluated, the
|
|
locals are passed to the class definition.
|
|
|
|
|
|
.. opcode:: RETURN_VALUE ()
|
|
|
|
Returns with TOS to the caller of the function.
|
|
|
|
|
|
.. opcode:: YIELD_VALUE ()
|
|
|
|
Pops ``TOS`` and yields it from a :term:`generator`.
|
|
|
|
|
|
.. opcode:: IMPORT_STAR ()
|
|
|
|
Loads all symbols not starting with ``'_'`` directly from the module TOS to the
|
|
local namespace. The module is popped after loading all names. This opcode
|
|
implements ``from module import *``.
|
|
|
|
|
|
.. opcode:: POP_BLOCK ()
|
|
|
|
Removes one block from the block stack. Per frame, there is a stack of blocks,
|
|
denoting nested loops, try statements, and such.
|
|
|
|
|
|
.. opcode:: END_FINALLY ()
|
|
|
|
Terminates a :keyword:`finally` clause. The interpreter recalls whether the
|
|
exception has to be re-raised, or whether the function returns, and continues
|
|
with the outer-next block.
|
|
|
|
|
|
.. opcode:: BUILD_CLASS ()
|
|
|
|
Creates a new class object. TOS is the methods dictionary, TOS1 the tuple of
|
|
the names of the base classes, and TOS2 the class name.
|
|
|
|
|
|
.. opcode:: WITH_CLEANUP ()
|
|
|
|
Cleans up the stack when a :keyword:`with` statement block exits. On top of
|
|
the stack are 1--3 values indicating how/why the finally clause was entered:
|
|
|
|
* TOP = ``None``
|
|
* (TOP, SECOND) = (``WHY_{RETURN,CONTINUE}``), retval
|
|
* TOP = ``WHY_*``; no retval below it
|
|
* (TOP, SECOND, THIRD) = exc_info()
|
|
|
|
Under them is EXIT, the context manager's :meth:`__exit__` bound method.
|
|
|
|
In the last case, ``EXIT(TOP, SECOND, THIRD)`` is called, otherwise
|
|
``EXIT(None, None, None)``.
|
|
|
|
EXIT is removed from the stack, leaving the values above it in the same
|
|
order. In addition, if the stack represents an exception, *and* the function
|
|
call returns a 'true' value, this information is "zapped", to prevent
|
|
``END_FINALLY`` from re-raising the exception. (But non-local gotos should
|
|
still be resumed.)
|
|
|
|
.. XXX explain the WHY stuff!
|
|
|
|
|
|
All of the following opcodes expect arguments. An argument is two bytes, with
|
|
the more significant byte last.
|
|
|
|
.. opcode:: STORE_NAME (namei)
|
|
|
|
Implements ``name = TOS``. *namei* is the index of *name* in the attribute
|
|
:attr:`co_names` of the code object. The compiler tries to use ``STORE_FAST``
|
|
or ``STORE_GLOBAL`` if possible.
|
|
|
|
|
|
.. opcode:: DELETE_NAME (namei)
|
|
|
|
Implements ``del name``, where *namei* is the index into :attr:`co_names`
|
|
attribute of the code object.
|
|
|
|
|
|
.. opcode:: UNPACK_SEQUENCE (count)
|
|
|
|
Unpacks TOS into *count* individual values, which are put onto the stack
|
|
right-to-left.
|
|
|
|
|
|
.. opcode:: DUP_TOPX (count)
|
|
|
|
Duplicate *count* items, keeping them in the same order. Due to implementation
|
|
limits, *count* should be between 1 and 5 inclusive.
|
|
|
|
|
|
.. opcode:: STORE_ATTR (namei)
|
|
|
|
Implements ``TOS.name = TOS1``, where *namei* is the index of name in
|
|
:attr:`co_names`.
|
|
|
|
|
|
.. opcode:: DELETE_ATTR (namei)
|
|
|
|
Implements ``del TOS.name``, using *namei* as index into :attr:`co_names`.
|
|
|
|
|
|
.. opcode:: STORE_GLOBAL (namei)
|
|
|
|
Works as ``STORE_NAME``, but stores the name as a global.
|
|
|
|
|
|
.. opcode:: DELETE_GLOBAL (namei)
|
|
|
|
Works as ``DELETE_NAME``, but deletes a global name.
|
|
|
|
|
|
.. opcode:: LOAD_CONST (consti)
|
|
|
|
Pushes ``co_consts[consti]`` onto the stack.
|
|
|
|
|
|
.. opcode:: LOAD_NAME (namei)
|
|
|
|
Pushes the value associated with ``co_names[namei]`` onto the stack.
|
|
|
|
|
|
.. opcode:: BUILD_TUPLE (count)
|
|
|
|
Creates a tuple consuming *count* items from the stack, and pushes the resulting
|
|
tuple onto the stack.
|
|
|
|
|
|
.. opcode:: BUILD_LIST (count)
|
|
|
|
Works as ``BUILD_TUPLE``, but creates a list.
|
|
|
|
|
|
.. opcode:: BUILD_SET (count)
|
|
|
|
Works as ``BUILD_TUPLE``, but creates a set.
|
|
|
|
|
|
.. opcode:: BUILD_MAP (count)
|
|
|
|
Pushes a new dictionary object onto the stack. The dictionary is pre-sized
|
|
to hold *count* entries.
|
|
|
|
|
|
.. opcode:: LOAD_ATTR (namei)
|
|
|
|
Replaces TOS with ``getattr(TOS, co_names[namei])``.
|
|
|
|
|
|
.. opcode:: COMPARE_OP (opname)
|
|
|
|
Performs a Boolean operation. The operation name can be found in
|
|
``cmp_op[opname]``.
|
|
|
|
|
|
.. opcode:: IMPORT_NAME (namei)
|
|
|
|
Imports the module ``co_names[namei]``. TOS and TOS1 are popped and provide
|
|
the *fromlist* and *level* arguments of :func:`__import__`. The module
|
|
object is pushed onto the stack. The current namespace is not affected:
|
|
for a proper import statement, a subsequent ``STORE_FAST`` instruction
|
|
modifies the namespace.
|
|
|
|
|
|
.. opcode:: IMPORT_FROM (namei)
|
|
|
|
Loads the attribute ``co_names[namei]`` from the module found in TOS. The
|
|
resulting object is pushed onto the stack, to be subsequently stored by a
|
|
``STORE_FAST`` instruction.
|
|
|
|
|
|
.. opcode:: JUMP_FORWARD (delta)
|
|
|
|
Increments bytecode counter by *delta*.
|
|
|
|
|
|
.. opcode:: JUMP_IF_TRUE (delta)
|
|
|
|
If TOS is true, increment the bytecode counter by *delta*. TOS is left on the
|
|
stack.
|
|
|
|
|
|
.. opcode:: JUMP_IF_FALSE (delta)
|
|
|
|
If TOS is false, increment the bytecode counter by *delta*. TOS is not
|
|
changed.
|
|
|
|
|
|
.. opcode:: JUMP_ABSOLUTE (target)
|
|
|
|
Set bytecode counter to *target*.
|
|
|
|
|
|
.. opcode:: FOR_ITER (delta)
|
|
|
|
``TOS`` is an :term:`iterator`. Call its :meth:`__next__` method. If this
|
|
yields a new value, push it on the stack (leaving the iterator below it). If
|
|
the iterator indicates it is exhausted ``TOS`` is popped, and the byte code
|
|
counter is incremented by *delta*.
|
|
|
|
|
|
.. opcode:: LOAD_GLOBAL (namei)
|
|
|
|
Loads the global named ``co_names[namei]`` onto the stack.
|
|
|
|
|
|
.. opcode:: SETUP_LOOP (delta)
|
|
|
|
Pushes a block for a loop onto the block stack. The block spans from the
|
|
current instruction with a size of *delta* bytes.
|
|
|
|
|
|
.. opcode:: SETUP_EXCEPT (delta)
|
|
|
|
Pushes a try block from a try-except clause onto the block stack. *delta* points
|
|
to the first except block.
|
|
|
|
|
|
.. opcode:: SETUP_FINALLY (delta)
|
|
|
|
Pushes a try block from a try-except clause onto the block stack. *delta* points
|
|
to the finally block.
|
|
|
|
.. opcode:: STORE_MAP ()
|
|
|
|
Store a key and value pair in a dictionary. Pops the key and value while leaving
|
|
the dictionary on the stack.
|
|
|
|
.. opcode:: LOAD_FAST (var_num)
|
|
|
|
Pushes a reference to the local ``co_varnames[var_num]`` onto the stack.
|
|
|
|
|
|
.. opcode:: STORE_FAST (var_num)
|
|
|
|
Stores TOS into the local ``co_varnames[var_num]``.
|
|
|
|
|
|
.. opcode:: DELETE_FAST (var_num)
|
|
|
|
Deletes local ``co_varnames[var_num]``.
|
|
|
|
|
|
.. opcode:: LOAD_CLOSURE (i)
|
|
|
|
Pushes a reference to the cell contained in slot *i* of the cell and free
|
|
variable storage. The name of the variable is ``co_cellvars[i]`` if *i* is
|
|
less than the length of *co_cellvars*. Otherwise it is ``co_freevars[i -
|
|
len(co_cellvars)]``.
|
|
|
|
|
|
.. opcode:: LOAD_DEREF (i)
|
|
|
|
Loads the cell contained in slot *i* of the cell and free variable storage.
|
|
Pushes a reference to the object the cell contains on the stack.
|
|
|
|
|
|
.. opcode:: STORE_DEREF (i)
|
|
|
|
Stores TOS into the cell contained in slot *i* of the cell and free variable
|
|
storage.
|
|
|
|
|
|
.. opcode:: SET_LINENO (lineno)
|
|
|
|
This opcode is obsolete.
|
|
|
|
|
|
.. opcode:: RAISE_VARARGS (argc)
|
|
|
|
Raises an exception. *argc* indicates the number of parameters to the raise
|
|
statement, ranging from 0 to 3. The handler will find the traceback as TOS2,
|
|
the parameter as TOS1, and the exception as TOS.
|
|
|
|
|
|
.. opcode:: CALL_FUNCTION (argc)
|
|
|
|
Calls a function. The low byte of *argc* indicates the number of positional
|
|
parameters, the high byte the number of keyword parameters. On the stack, the
|
|
opcode finds the keyword parameters first. For each keyword argument, the value
|
|
is on top of the key. Below the keyword parameters, the positional parameters
|
|
are on the stack, with the right-most parameter on top. Below the parameters,
|
|
the function object to call is on the stack.
|
|
|
|
|
|
.. opcode:: MAKE_FUNCTION (argc)
|
|
|
|
Pushes a new function object on the stack. TOS is the code associated with the
|
|
function. The function object is defined to have *argc* default parameters,
|
|
which are found below TOS.
|
|
|
|
|
|
.. opcode:: MAKE_CLOSURE (argc)
|
|
|
|
Creates a new function object, sets its *__closure__* slot, and pushes it on
|
|
the stack. TOS is the code associated with the function, TOS1 the tuple
|
|
containing cells for the closure's free variables. The function also has
|
|
*argc* default parameters, which are found below the cells.
|
|
|
|
|
|
.. opcode:: BUILD_SLICE (argc)
|
|
|
|
.. index:: builtin: slice
|
|
|
|
Pushes a slice object on the stack. *argc* must be 2 or 3. If it is 2,
|
|
``slice(TOS1, TOS)`` is pushed; if it is 3, ``slice(TOS2, TOS1, TOS)`` is
|
|
pushed. See the :func:`slice` built-in function for more information.
|
|
|
|
|
|
.. opcode:: EXTENDED_ARG (ext)
|
|
|
|
Prefixes any opcode which has an argument too big to fit into the default two
|
|
bytes. *ext* holds two additional bytes which, taken together with the
|
|
subsequent opcode's argument, comprise a four-byte argument, *ext* being the two
|
|
most-significant bytes.
|
|
|
|
|
|
.. opcode:: CALL_FUNCTION_VAR (argc)
|
|
|
|
Calls a function. *argc* is interpreted as in ``CALL_FUNCTION``. The top element
|
|
on the stack contains the variable argument list, followed by keyword and
|
|
positional arguments.
|
|
|
|
|
|
.. opcode:: CALL_FUNCTION_KW (argc)
|
|
|
|
Calls a function. *argc* is interpreted as in ``CALL_FUNCTION``. The top element
|
|
on the stack contains the keyword arguments dictionary, followed by explicit
|
|
keyword and positional arguments.
|
|
|
|
|
|
.. opcode:: CALL_FUNCTION_VAR_KW (argc)
|
|
|
|
Calls a function. *argc* is interpreted as in ``CALL_FUNCTION``. The top
|
|
element on the stack contains the keyword arguments dictionary, followed by the
|
|
variable-arguments tuple, followed by explicit keyword and positional arguments.
|
|
|
|
|
|
.. opcode:: HAVE_ARGUMENT ()
|
|
|
|
This is not really an opcode. It identifies the dividing line between opcodes
|
|
which don't take arguments ``< HAVE_ARGUMENT`` and those which do ``>=
|
|
HAVE_ARGUMENT``.
|
|
|