lightning/docs/source-app/workflows/run_work_once_content.rst

152 lines
5.4 KiB
ReStructuredText

********************************************************
What caching the calls of Work's run method does for you
********************************************************
By default, the run method in a LightningWork (Work) "remembers" (caches) the input arguments it is getting called with and does not execute again if called with the same arguments again.
In other words, the run method only executes when the input arguments have never been seen before.
You can turn caching on or off:
.. code-block:: python
# Run only when the input arguments change (default)
work = MyWork(cache_calls=True)
# Run everytime regardless of whether input arguments change or not
work = MyWork(cache_calls=False)
To better understand this, imagine that every day you want to sequentially download and process some data and then train a model on that data.
As explained in the `Event Loop guide <../glossary/event_loop.html>`_, the Lightning App runs within an infinite while loop, so the pseudo-code of your application might looks like this:
.. code-block:: python
from datetime import datetime
# Lightning code
while True: # This is the Lightning Event Loop
# Your code
today = datetime.now().strftime("%D") # '05/25/22'
data_processor.run(today)
train_model.run(data_processor.data)
In this scenario, you want your components to run ``once`` a day, and no more than that! But your code is running within an infinite loop, how can this even work?
This is where the Work's internal caching mechanism comes in. By default, Lightning caches a hash of the input provided to its run method and won't re-execute the method if the same input is provided again.
In the example above, the **data_processor** component run method receives the string **"05/25/22"**. It runs one time and any further execution during the day is skipped until tomorrow is reached and the work run method receives **06/25/22**. This logic applies everyday.
This caching mechanism is inspired from how `React.js Components and Props <https://reactjs.org/docs/components-and-props.html>`_ renders websites. Only changes to the inputs re-trigger execution.
***************
Caching Example
***************
Here's an example of this behavior with LightningWork:
.. code:: python
:emphasize-lines: 11, 17
import lightning as L
class ExampleWork(L.LightningWork):
def run(self, *args, **kwargs):
print(f"I received the following props: args: {args} kwargs: {kwargs}")
work = ExampleWork()
work.run(value=1)
# Providing the same value. This won't run as already cached.
work.run(value=1)
work.run(value=1)
work.run(value=1)
work.run(value=1)
# Changing the provided value. This isn't cached and will run again.
work.run(value=10)
And you should see the following by running the code above:
.. code-block:: console
$ python example.py
INFO: Your app has started. View it in your browser: http://127.0.0.1:7501/view
# After you have clicked `run` on the UI.
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 10}
As you can see, the intermediate run didn't execute, as we would expected when ``cache_calls=True``.
***********************************
Implications of turning caching off
***********************************
By setting ``cache_calls=False``, Lightning won't cache the return value and re-execute the run method on every call.
.. code:: python
:emphasize-lines: 7
from lightning.app import LightningWork
class ExampleWork(LightningWork):
def run(self, *args, **kwargs):
print(f"I received the following props: args: {args} kwargs: {kwargs}")
work = ExampleWork(cache_calls=False)
work.run(value=1)
# Providing the same value. This won't run as already cached.
work.run(value=1)
work.run(value=1)
work.run(value=1)
work.run(value=1)
# Changing the provided value. This isn't cached and will run again.
work.run(value=10)
.. code-block:: console
$ python example.py
INFO: Your app has started. View it in your browser: http://127.0.0.1:7501/view
# After you have clicked `run` on the UI.
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 1}
I received the following props: args: () kwargs: {'value': 10}
Be aware than when setting both ``cache_calls=False`` and ``parallel=False`` to a work, the code after the ``self.work.run()`` is unreachable
as the work continuously execute in a blocking way.
.. code-block:: python
:emphasize-lines: 9-10
from lightning.app import LightningApp, LightningFlow, LightningWork
class Flow(LightningFlow):
def __init__(self):
super().__init__()
self.work = Work(cache_calls=False, parallel=False)
def run(self):
print("HERE BEFORE")
self.work.run()
print("HERE AFTER")
app = LightningApp(Flow())
.. code-block:: console
$ lightning run app app.py
INFO: Your app has started. View it in your browser: http://127.0.0.1:7501/view
print("HERE BEFORE")
print("HERE BEFORE")
print("HERE BEFORE")
...