2014-06-30 16:59:58 +00:00
|
|
|
Asynchronous and non-Blocking I/O
|
|
|
|
---------------------------------
|
2014-06-25 01:58:35 +00:00
|
|
|
|
|
|
|
Real-time web features require a long-lived mostly-idle connection per
|
|
|
|
user. In a traditional synchronous web server, this implies devoting
|
|
|
|
one thread to each user, which can be very expensive.
|
|
|
|
|
|
|
|
To minimize the cost of concurrent connections, Tornado uses a
|
|
|
|
single-threaded event loop. This means that all application code
|
|
|
|
should aim to be asynchronous and non-blocking because only one
|
|
|
|
operation can be active at a time.
|
|
|
|
|
|
|
|
The terms asynchronous and non-blocking are closely related and are
|
|
|
|
often used interchangeably, but they are not quite the same thing.
|
|
|
|
|
|
|
|
Blocking
|
|
|
|
~~~~~~~~
|
|
|
|
|
|
|
|
A function **blocks** when it waits for something to happen before
|
|
|
|
returning. A function may block for many reasons: network I/O, disk
|
|
|
|
I/O, mutexes, etc. In fact, *every* function blocks, at least a
|
|
|
|
little bit, while it is running and using the CPU (for an extreme
|
|
|
|
example that demonstrates why CPU blocking must be taken as seriously
|
|
|
|
as other kinds of blocking, consider password hashing functions like
|
|
|
|
`bcrypt <http://bcrypt.sourceforge.net/>`_, which by design use
|
|
|
|
hundreds of milliseconds of CPU time, far more than a typical network
|
|
|
|
or disk access).
|
|
|
|
|
|
|
|
A function can be blocking in some respects and non-blocking in
|
|
|
|
others. For example, `tornado.httpclient` in the default
|
|
|
|
configuration blocks on DNS resolution but not on other network access
|
|
|
|
(to mitigate this use `.ThreadedResolver` or a
|
|
|
|
``tornado.curl_httpclient`` with a properly-configured build of
|
|
|
|
``libcurl``). In the context of Tornado we generally talk about
|
|
|
|
blocking in the context of network I/O, although all kinds of blocking
|
|
|
|
are to be minimized.
|
|
|
|
|
|
|
|
Asynchronous
|
|
|
|
~~~~~~~~~~~~
|
|
|
|
|
|
|
|
An **asynchronous** function returns before it is finished, and
|
|
|
|
generally causes some work to happen in the background before
|
|
|
|
triggering some future action in the application (as opposed to normal
|
|
|
|
**synchronous** functions, which do everything they are going to do
|
|
|
|
before returning). There are many styles of asynchronous interfaces:
|
|
|
|
|
|
|
|
* Callback argument
|
|
|
|
* Return a placeholder (`.Future`, ``Promise``, ``Deferred``)
|
|
|
|
* Deliver to a queue
|
|
|
|
* Callback registry (e.g. POSIX signals)
|
|
|
|
|
|
|
|
Regardless of which type of interface is used, asynchronous functions
|
|
|
|
*by definition* interact differently with their callers; there is no
|
|
|
|
free way to make a synchronous function asynchronous in a way that is
|
|
|
|
transparent to its callers (systems like `gevent
|
|
|
|
<http://www.gevent.org>`_ use lightweight threads to offer performance
|
|
|
|
comparable to asynchronous systems, but they do not actually make
|
|
|
|
things asynchronous).
|
|
|
|
|
|
|
|
Examples
|
|
|
|
~~~~~~~~
|
|
|
|
|
|
|
|
Here is a sample synchronous function::
|
|
|
|
|
|
|
|
from tornado.httpclient import HTTPClient
|
|
|
|
|
|
|
|
def synchronous_fetch(url):
|
|
|
|
http_client = HTTPClient()
|
|
|
|
response = http_client.fetch(url)
|
|
|
|
return response.body
|
|
|
|
|
|
|
|
And here is the same function rewritten to be asynchronous with a
|
|
|
|
callback argument::
|
|
|
|
|
|
|
|
from tornado.httpclient import AsyncHTTPClient
|
|
|
|
|
|
|
|
def asynchronous_fetch(url, callback):
|
|
|
|
http_client = AsyncHTTPClient()
|
|
|
|
def handle_response(response):
|
|
|
|
callback(response.body)
|
|
|
|
http_client.fetch(url)
|
|
|
|
|
|
|
|
And again with a `.Future` instead of a callback::
|
|
|
|
|
|
|
|
from tornado.concurrent import Future
|
|
|
|
|
|
|
|
def async_fetch_future(url):
|
|
|
|
http_client = AsyncHTTPClient()
|
|
|
|
my_future = Future()
|
|
|
|
fetch_future = http_client.fetch(url)
|
|
|
|
fetch_future.add_done_callback(
|
|
|
|
lambda f: my_future.set_result(f.result()))
|
|
|
|
return my_future
|
|
|
|
|
|
|
|
The raw `.Future` version is more complex, but ``Futures`` are
|
|
|
|
nonetheless recommended practice in Tornado because they have two
|
|
|
|
major advantages. Error handling is more consistent since the
|
|
|
|
`.Future.result` method can simply raise an exception (as opposed to
|
|
|
|
the ad-hoc error handling common in callback-oriented interfaces), and
|
|
|
|
``Futures`` lend themselves well to use with coroutines. Coroutines
|
|
|
|
will be discussed in depth in the next section of this guide. Here is
|
|
|
|
the coroutine version of our sample function, which is very similar to
|
|
|
|
the original synchronous version::
|
|
|
|
|
|
|
|
from tornado import gen
|
|
|
|
|
|
|
|
@gen.coroutine
|
|
|
|
def fetch_coroutine(url):
|
|
|
|
http_client = AsyncHTTPClient()
|
|
|
|
response = yield http_client.fetch(url)
|
|
|
|
return response.body
|