Commit Graph

286 Commits

Author SHA1 Message Date
David Wilson 5913be64d7 docs: remove last remaining major risk :D 2018-04-22 02:50:04 +01:00
David Wilson 4c5e13bf87 core: add Stream.pending_bytes() accessor. 2018-04-22 02:17:09 +01:00
David Wilson 7c88e4d013 Move _DEAD into header, autogenerate dead messages
This change blocks off 2 common scenarios where a race condition is
upgraded to a hang, when the library could internally do better.

* Since we don't know whether the receiver of a `reply_to` is expecting
  a raw or pickled message, and since in the case of a raw reply, there
  is no way to signal "dead" to the receiver, override the reply_to
  field to explicitly mark a message as dead using a special handle.

  This replaces the serialized _DEAD sentinel value with a slightly
  neater interface, in the form of the reserved IS_DEAD handle, and
  enables an important subsequent change: when a context cannot route a
  message, it can send a generic 'dead' reply back towards the message
  source, ensuring any sleeping thread is woken with ChannelError.

  The use of this field could potentially be extended later on if
  additional flags are needed, but for now this seems to suffice.

* Teach Router._invoke() to reply with a dead message when it receives a
  message for an invalid local handle.

* Teach Router._async_route() to reply with a dead message when it
  receives an unroutable message.
2018-04-22 00:17:27 +01:00
David Wilson e43c6c531b Mostly implement hybrid TTY/socket mode for sudo and SSH.
Presently there is still no mechanism to add :attr:`tty_stream` to the
multiplexer after connection is successful, but for now it's not
expected that anything will be logged to it anyway.

Closes #148.
2018-04-18 21:40:24 +01:00
David Wilson 6aad45748b docs: typo 2018-04-13 15:18:30 +01:00
David Wilson 9f94fb78c8 issue #159: make LRU size configurable. 2018-04-13 15:16:54 +01:00
David Wilson a96969ee29 docs: concision 2018-04-13 13:54:53 +01:00
David Wilson cc980569a3 issue #159: initial context LRU implementation
Now Connection.close() *must* be called in the worker, to ensure the
reference count for a context drops correctly.

Remove 'discriminator' for now, I'm not using it for testing any more
and it complicated this code.

This code is a car crash, it needs rewritten again. Ideally some/most of
this behaviour could live on services.DeduplicatingService somehow, but
I couldn't come up with a sensible design.
2018-04-13 13:49:16 +01:00
David Wilson 38c0ad1eea core: don't deregister Router handles until Broker exit.
Lots of "invalid handle: ..., 102" messages started appearing during
exit recently because ordering changed slightly, and local handles were
sent _DEAD even though the broker loop was still progressing through
shutdown.

The "shutdown" event is too early to close handles: it is the start of
the grace period where streams and downstream contexts can finish up any
work and deliver buffered data, including FORWARD_LOG messages that
haven't arrived yet.

So instead,

- move the _DEAD logic to the "exit" event,
- get rid of Context.on_shutdown() entirely, it's been unused for over
  a month,
- get rid of the "crash" event, since it always fires prior to "exit",
  and its only use was to send _DEAD to local handles, which now happens
  during exit anyway.
2018-04-11 01:51:27 +01:00
David Wilson 0e839fe63c docs: override insane list of fallback fonts
WHY ARE THE DOCS RENDERING IN ITALIC HOW LONG HAS IT BEEN LIKE THIS
ARGAHGR
2018-04-09 20:10:39 +01:00
David Wilson e9f94e1bbb docs: tidy up big list of bullets. 2018-04-09 15:12:39 +01:00
David Wilson 43ecbe350c docs: rearrange more ansible risks 2018-04-09 00:32:52 +01:00
David Wilson 19a418cc6f docs: document mitogen_task_isolation. 2018-04-09 00:10:21 +01:00
David Wilson ba74f8f743 docs: remove Ansible risk 2018-04-09 00:05:57 +01:00
David Wilson 3613162bc0 ansible: enable forking when requested and for async jobs.
Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.
2018-04-09 00:03:09 +01:00
David Wilson a643f13ebe issue #106: docs: tidyup. 2018-04-06 17:18:37 +01:00
David Wilson 432ebbca89 issue #106: docs: initial docs for how modules execute. 2018-04-06 17:09:14 +01:00
David Wilson b595314619 docs: fix intensely annoying _prefix, 2 years later. 2018-04-06 16:50:36 +01:00
David Wilson 6aeb4e9f05 issue #164: precisely emulate Ansible's stdio behaviour.
* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.
2018-04-05 01:16:34 +01:00
David Wilson e0381606af Ensure remote_tmp is respected everywhere.
Logic is still somewhat different from Ansible: we don't have to care
about sudo/non-sudo cases, etc.
2018-04-04 14:05:57 +01:00
David Wilson 38311336e1 docs: link to Ansible video demo 2018-04-03 11:13:41 +01:00
David Wilson 380ef7376d ansible: Add support for free strategy. 2018-04-02 00:01:28 +01:00
David Wilson 8425b196e7 docs: merge duplicate risks 2018-04-01 22:34:53 +01:00
David Wilson 6dcefd631a issue #106: docs: remove built-in only limitation :> 2018-04-01 22:09:56 +01:00
David Wilson 6958b8ff09 docs: More getting started. 2018-03-30 13:05:47 +05:45
David Wilson 8674ec42dd docs: add new risk 2018-03-30 01:37:17 +05:45
David Wilson bbb0f1bbd8 issue #155: fix double-fork behaviour and test it this time. 2018-03-29 23:57:24 +05:45
David Wilson 6670cba41c Introduce handler policy functions; closes #138.
Now you can specify a function to add_handler() that authenticates the
message header, with has_parent_authority() and is_immediate_child()
built in.
2018-03-29 21:40:33 +05:45
David Wilson 80a97fbc9b core: Rename Sender.put() to Sender.send().
Been annoying me for months.
2018-03-29 15:43:48 +05:45
David Wilson b48d63f33b docs: add to_sender() and update serialization notes 2018-03-29 15:38:44 +05:45
David Wilson 8676c40674 core: make _start_transmit / _stop_transmit async-only
For now at least, these APIs are always used in an asynchronous context,
so stop using the defer mechanism.
2018-03-29 15:10:43 +05:45
David Wilson c33bc22f21 docs: fix reading time tracking :( 2018-03-28 18:38:20 +05:45
David Wilson 4903052f42 fork: reseed Python/SSL PRNGs
Mac's SSL seems to have a pthread_atfork handler or similar that does
this for us, no clue if Linux is the same.
2018-03-27 11:58:17 +05:45
David Wilson 7f4368db87 docs: pad out signal handlers note 2018-03-27 11:38:48 +05:45
David Wilson d68833757b docs: slightly bikeshed last testimonial 2018-03-26 18:37:12 +05:45
dw d23a6174a2
Merge pull request #167 from danielcompton/patch-1
Add testimonial from Clojars
2018-03-26 18:35:55 +05:45
David Wilson 0dbb63ae3d docs: typo 2018-03-26 16:45:25 +05:45
David Wilson b254373cd7 docs: add local task serialization risk 2018-03-26 15:37:51 +05:45
Daniel Compton 871b853b59
Add testimonial from Clojars 2018-03-26 10:02:13 +13:00
David Wilson 2813d1a968 docs: update 2018-03-25 14:58:20 +05:45
David Wilson 5579ac936b issue #155: docs: more fork() liability warnings 2018-03-25 10:08:46 +05:45
David Wilson f457f54758 docs: fix formatting 2018-03-24 22:16:06 +05:45
David Wilson d370290687 docs: one more warning 2018-03-24 22:15:24 +05:45
David Wilson 03e08e25fd docs: put huge liability notice in fork() docs. 2018-03-24 22:13:36 +05:45
David Wilson 48351a1889 issue #155: parent: support Context.shutdown(), reap children on exit.
This permits graceful shutdown of individual contexts, without tearing
down everything.

Update mitogen.parent.Stream to also wait for the child to exit, to
prevent the buildup of zombie processes. This introduces a blocking wait
for process exit on the Broker thread, let's see if we can get away with
it. Chances are reasonable that it'll cause needless hangs on heavily
loaded machines.
2018-03-24 19:19:51 +05:45
David Wilson 6a74edce6b issue #155: parent: move master.Context into parent.
The Context and Router APIs for constructing children and making
function calls should be available in every parent context, as user code
wants to have access to the same API.
2018-03-24 19:19:45 +05:45
David Wilson 447353ecb8 docs: ansible.rst: note multi-host perf isn't great right now 2018-03-24 18:54:02 +05:45
David Wilson 20780820a6 docs: typo 2018-03-24 15:58:42 +05:45
David Wilson 75b9e1d71e issue #155: docs: document behaviour of forked children 2018-03-24 15:57:03 +05:45
David Wilson 110fdf24cd docs: add mitogen.fork.Stream to internals.rst 2018-03-24 15:53:06 +05:45
David Wilson 1a8ac9f4d1 issue #155: introduce mitogen.fork / Router.fork() 2018-03-24 15:50:28 +05:45
David Wilson 27175e3126 docs: tidy up signals.rst a little. 2018-03-23 16:52:54 +05:45
David Wilson 732a610246 docs: add beginnings of section on func decorators 2018-03-23 09:31:29 +05:45
David Wilson 2abe87472c issue #162: docs: begin documenting mitogen.service 2018-03-23 09:31:11 +05:45
David Wilson 54ff1c90fa issue #155: add DEL_ROUTE, propagate ADD_ROUTE upwards
* IDs are allocated by the parent responsible for contructing a new
  child, using ALLOCATE_ID to the master as necessary to allocate new ID
  ranges.

* ADD_ROUTE is sent up the tree rather than down. This permits
  construction of the new context to complete concurrent to parent
  contexts learning about its existence. Since all streams are strictly
  ordered, it's not possible for any parent to observe messages from the
  new context prior to arrival of an ADD_ROUTE from the parent notifying
  of its existence.

  If the new context, for example, implements an Ansible async task, its
  parent can start executing that without waiting for any synchronous
  confirmation from any parent or the master.

* Since routes propagate up, it's no longer possible for a plain
  non-parent child to ever receive ADD_ROUTE, so that code can be moved
  out of core.py and into parent.py (-0.2kb compressed).

* Add a .routes attribute to parent.Stream, and respond to disconnection
  signal on the stream by propagating DEL_ROUTE for any ADD_ROUTE ever
  received from that stream.

* Centralize route management in a new parent.RouteMonitor class
2018-03-22 11:56:24 +05:45
David Wilson aeeeb45ccb docs: farewell, glorious iframe! 2018-03-22 07:13:42 +05:45
David Wilson 1ed86774b5 issue #156: document select exception 2018-03-21 09:23:54 +05:45
David Wilson 20f5d89dfa issue #156: fix several more races
* Don't need to sleep if queue>sleepers, can just pop the right queue
  element and return it.

* If queue>sleeping and waking==sleeping, no mechanism existed to ensure
  a thread newly added to sleeping would ever be woken. Above change
  fixes that.

* Cannot trust select() return value, scheduler might sleep us
  indefinitely while put() writes a byte.

* Sleeping threads didn't pop FIFO, they popped in whatever order
  scheduler woke them up. Must recover index and use it to pick the pop
  index.
2018-03-20 14:53:19 +05:45
David Wilson 526b0a514b issue #156: prevent Latch.close() triggering spurious wakeups 2018-03-20 13:14:51 +05:45
David Wilson 18e2977baf docs: annoying phrasing 2018-03-20 13:05:41 +05:45
David Wilson 2c22c41819 issue #156: don't decrement `waking` if we timed out rather than being woken. 2018-03-20 13:02:46 +05:45
David Wilson 07a8994ff5 issue #156: waking thread result dictionary with an integer. 2018-03-20 12:55:55 +05:45
David Wilson b5398bd17f issue #156: docs typo 2018-03-20 09:12:50 +05:45
David Wilson 512ff77a46 issue #156: prevent non-sleeping threads from starving sleeping threads.
See new docs
2018-03-20 09:12:45 +05:45
David Wilson 148ce1d703 issue #155: increase context ID width to 32 bits
Needed to make large range allocations (1000 per ALLOCATE_ID roundtrip)
feasible.
2018-03-19 21:58:35 +05:45
David Wilson 071d9fbfb3 docs: tidy ansible docs. 2018-03-19 21:58:35 +05:45
David Wilson f241eac5ce parent: allow Python to determine its install prefix from argv[0]
Fixes support for virtualenv. Closes #152.
2018-03-19 21:58:35 +05:45
David Wilson dec3af375a issue #144: ansible: increase default pool size to 16. 2018-03-19 21:58:35 +05:45
David Wilson 4f361be7e7 issue #144: teach Select() to close its latch
Causes all threads sleeping on the select to wake.
2018-03-19 21:58:35 +05:45
David Wilson 8aada2646c core: support throwing LatchError in every sleeping thread
This is to allow Select() to be used as a generic queueing primitive
that supports graceful shutdown.
2018-03-19 21:58:35 +05:45
David Wilson 7a74bb0a39 docs: update ansible risks/differences. 2018-03-19 21:58:35 +05:45
David Wilson bcc15987fc docs: extra ansible paragraph. 2018-03-19 21:58:35 +05:45
David Wilson 7a394dc73e ansible: allow establishment of duplicate SSH connections 2018-03-19 21:58:35 +05:45
David Wilson 4691ce0b95 issue #150: ansible: add basic Docker support. 2018-03-19 21:58:34 +05:45
David Wilson 67ff762ba5 issue #139: docs: remove note about bad buffering 2018-03-19 21:58:34 +05:45
David Wilson d1888f1908 docs: reorder sections 2018-03-19 21:58:34 +05:45
David Wilson 014247ce66 docs: another crazy Ansible success story 2018-03-19 21:58:34 +05:45
Alex Willmer 7063d172e9 tests: Add Tox config for Python 2.6 and 2.7
I could not get Python 2.5 or earlier to work. Too many packages
(critically docker) don't support it.
2018-03-19 21:58:34 +05:45
David Wilson 88c198ea05 issue #141: copy Ansible's connect_timeout for sudo too. 2018-03-19 21:58:34 +05:45
David Wilson 63c3fc623c docs: note the semantic difference in Mitogen vs. Ansible timeouts
Related to issue #141.
2018-03-19 21:58:34 +05:45
David Wilson 8f85943083 issue #139: mention relating buffering issue 2018-03-19 21:58:34 +05:45
David Wilson 1f1d691a28 docs: update to match @moreati's code golf birdies :) 2018-03-19 21:58:34 +05:45
Alex Willmer 9aa83ef77f docs: First round of Pickle-likes survey 2018-03-19 21:58:33 +05:45
Alex Willmer a1fc21bb06 docs: Maximum size of pencode values 2018-03-19 21:58:33 +05:45
Alex Willmer e24db89f3a docs: Disco comparison 2018-03-19 21:58:33 +05:45
Alex Willmer 04f4851138 docs: multiprocessing comparison
Not strictly a rival, but has enough commonalities to be worth noting
2018-03-19 21:58:33 +05:45
Alex Willmer 8c227b2bdd docs: More detail about Baker 2018-03-19 21:58:33 +05:45
Alex Willmer e06e438228 docs: More detail about execnet 2018-03-19 21:58:33 +05:45
Alex Willmer da58f8595d docs: More detail about chopsticks 2018-03-19 21:58:33 +05:45
Alex Willmer d7fbb9aef6 docs: Link compared projects to their website
All outgoing links checked with

```bash
cd docs
make linkcheck
```
2018-03-19 21:58:33 +05:45
Alex Willmer 4615ab1a8e docs: Enable sphinx-autobuild
```bash
cd docs
make
```

to run a webserver that automatically rerenders whenever the rST is
modified.
2018-03-19 21:58:33 +05:45
David Wilson f8c85b549f docs: typo 2018-03-19 21:58:33 +05:45
David Wilson dc9dda08e1 Add link to IRC; closes #116 2018-03-19 21:58:33 +05:45
David Wilson b2a433dcc4 ssh: CompressionLevel is a v1-only option.
It's ignored by newer SSHes, which only pay attention to Compression.
2018-03-19 21:58:33 +05:45
Alex Willmer 1c20c61605 docs: Convert all URLs that support https://
Excluded: graphml XML namespaces, links to e.g. Fabric homepage

Fixes #128
2018-03-19 21:58:33 +05:45
David Wilson 089f39716e docs: beginnings of testimonials section 2018-03-19 21:58:33 +05:45
David Wilson 500c4f419c docs: update limitations list for Ansible. 2018-03-19 21:58:33 +05:45
David Wilson 537a263261 docs: So many typos 2018-03-19 21:58:33 +05:45
David Wilson 6bd4c5ead5 docs: Ansible logging update (#111) 2018-03-19 21:58:33 +05:45
David Wilson 4f352d7d4b Pin Sphinx version 2018-03-19 21:58:32 +05:45