Added minimal code example to Channels, fix #14312 #17674

kshyatt · 2016-07-28T18:16:29Z

tkelman · 2016-07-28T18:41:52Z

doc/manual/parallel-computing.rst

@@ -492,6 +492,27 @@ and size 10. The channel exists on worker ``pid``\ .
 Methods ``put!``\ , ``take!``\ , ``fetch``\ , ``isready`` and ``wait`` on a ``RemoteChannel`` are proxied onto
 the backing store on the remote process.

+As an example, we can construct a ``RemoteChannel`` to a worker, ``put!``\ an index inside, then ``take!``\ it out::


the \ should only be needed if there's punctuation, spaces should be fine

You can see I don't know RST very well.

ViralBShah · 2016-07-28T19:23:10Z

I wonder if we should separate the distributed and parallel computing sections of this page. There are many lower-level primitives that we have for general purpose distributed computing, whereas many users of parallel computing are perhaps content with pmap, @parallel and other similar patterns that we may introduce over time.

ViralBShah · 2016-07-28T19:24:57Z

Maybe some of this belongs to the Networking and Streams section? Sorry for creating the noise here, but this PR triggered the thought.

mweastwood · 2016-07-28T20:07:47Z

I don't think this example is ideal because you are showing that the master process can put! and take! from the channels. We'd like to see the workers taking what the master process put.

Maybe something like this:

addprocs(5)
channels = Dict(worker => RemoteChannel() for worker in workers())
@everywhere take_and_print!(c) = println(take!(c))
for worker in workers()
    put!(channels[worker], worker)
    remotecall_fetch(take_and_print!, worker, channels[worker])
end

Which should output:

    From worker 2:  2
    From worker 3:  3
    From worker 4:  4
    From worker 5:  5
    From worker 6:  6

kshyatt · 2016-07-28T20:13:05Z

That's a great point, @mweastwood. Why don't we include both, so that people can generalize from the easy case (master puts and takes) to the harder one?

tkelman · 2016-07-29T02:56:34Z

doc/manual/parallel-computing.rst

@@ -492,6 +492,46 @@ and size 10. The channel exists on worker ``pid``\ .
 Methods ``put!``\ , ``take!``\ , ``fetch``\ , ``isready`` and ``wait`` on a ``RemoteChannel`` are proxied onto
 the backing store on the remote process.

+As an example, we can construct a ``RemoteChannel`` to a worker, ``put!`` an index inside, then ``take!`` it out:
+
+.. doctest::


I think the doctest format is a little pickier than this, most of them look for julia> on the inputs then non-prefixed output like you'd get at the repl

amitmurthy · 2016-07-29T03:44:39Z

Thanks for doing this (again!).

I was thinking that we could take the same pmap code example in the previous section and rewrite it once using Channels and again using RemoteChannels.

amitmurthy · 2016-07-29T03:59:34Z

As written with all the remote channels refer to channels on the master only. The workers seem incidental.

An example implementation of pmap with Channels:

    function pmap(f, lst)
        np = nprocs()

        # Create a work queue which takes in a tuple of (id, function).
        work_q = Channel{Tuple}(np)
        results_q = Channel{Tuple}(length(lst))

        # Start a task to feed the work queue
        @async begin
            for (job_id, v) in enumerate(lst)
                put!(work_q, (job_id, ()->f(v...)))
            end
            close(work_q)
        end

        @sync begin
            for p in workers()
                # start as many feeder tasks as workers
                @async begin
                    # each task runs till the work_q is open.
                    while isopen(work_q)
                        job_id, job = take!(work_q)
                        job_result = remotecall_fetch(job, p)
                        put!(results_q, (job_id, job_result))
                    end
                end
            end
        end
        close(results_q)

        return map(x->x[2], sort(collect(results_q); lt=(x,y)->x[1]<y[1]))
    end

Would like it simplified further though. I find Alan's suggestion of not having example code blocks more than a single page quite relevant for easier readabilty.

We could then rewrite this once again using RemoteChannels.

Alternatively, we could go with an example of a simple web service that distributes computation among workers. This would refer to package HTTPServer.jl in the example and distribute incoming requests among workers.

amitmurthy · 2016-08-03T17:59:27Z

I took another shot at a simple example for Channels.
It computes the MD5 of files in a directory using an external program md5. Concurrency will be limited to the number of cores on the machine.


    function compute_md5(dir)
        work_q = Channel(Sys.CPU_CORES)
        results_q = Channel(Sys.CPU_CORES)

        # Create a feeder task which asynchronously adds file names to a work queue
        @schedule begin
            for (root, _, files) in walkdir(dir)
                for file in files
                    put!(work_q, joinpath(root,file))
                end
            end

            # close the queue once all filenames have been added. This causes
            # worker tasks iterating over the work_q to exit. 
            close(work_q)
        end

        # Create as many worker tasks as number of cores on the machine.
        # Close the results_q once all tasks have finished.
        # Execute the entire worker task set in a seperate task itself 

        @schedule begin                                         # Schedule a task which starts and waits for worker tasks
            @sync begin                                         # Wait for all worker tasks to finish
                for _ in 1:Sys.CPU_CORES                        # As many worker tasks as cores
                    @async begin                                # One worker task per core
                        for file in work_q                      # Process files from channel till it is closed.
                            strm, process = open(`md5 $file`)   # Launch external program
                            wait(process)                       # Wait for external program to complete
                            put!(results_q, readstring(strm))   # collect output and write out the result
                        end
                    end
                end
            end
            close(results_q)                                    # All worker tasks have finished. Close the results channel. 
        end

        # Println out the results as they are processed.
        for md5 in results_q
            println(md5)
        end
    end

Does this work better for a beginner reading the section on Channels? Note that this does not cover RemoteChannels.

kshyatt · 2016-08-03T18:11:50Z

I might show the full example and then go through line by line (might be easier to read for a newbie) explaining what the more complex blocks do. Then you can say "for more information about @async, see...".

Might be nice also to show some sample output.

I think the example is great, and RemoteChannel can wait or perhaps a better choice is to have a Jupyter notebook showing that?

vtjnash · 2016-08-03T19:53:11Z

Delete the line "wait(process)" to remove a livelock failure

amitmurthy · 2016-08-03T19:57:47Z

@vtjnash how can that trigger a livelock failure? Just to understand what is happening under the hood.

vtjnash · 2016-08-03T20:07:26Z

It puts upstream pressure on the writer to prevent it from exiting since there is no active reader to consume the data

amitmurthy · 2016-08-09T05:54:39Z

The same example above done with RemoteChannels. Should mention that it is applicable when 1) processes are distributed across nodes and 2) the directory is on a network file system.

addprocs()                   # Starts as many workers as cores
                             # For starting workers distributed across machines use
                             # addprocs([(h, :auto) for h in [host1, host2...]])
                             # This will launch workers on host1, host2....
                             # The :auto option results in as many workers as cores on each host.

function compute_md5(dir)
    work_q = RemoteChannel(()->Channel(nworkers()))
    results_q = RemoteChannel(()->Channel(nworkers()))

    # Function to read filename and write md5 values from the remote channels
    # This will be run on each worker.
    md5_func = () -> begin
        while (true)
            file = take!(work_q)

            file == :DONE && break

            strm, process = open(`md5 $file`)           # Launch external program
            put!(results_q, readstring(strm))           # collect output and write out the result
        end
    end

    # Create a feeder task which asynchronously adds file names to the work queue
    nfiles = 0
    @schedule begin
        for (root, _, files) in walkdir(dir)    # `dir` should be on a network file system
                                                # if workers are distributed across nodes.
            for file in files
                put!(work_q, joinpath(root,file))
                nfiles += 1
            end
        end

        # release all the remote tasks
        for _ in 1:nworkers()
            put!(work_q, :DONE)
        end
    end

    # Launch tasks on each worker to process requests.
    for p in procs()
        remotecall(md5_func, p)
    end

    # Print out the results as they are processed.
    while nfiles > 1
        println(take!(results_q))
        nfiles -= 1
    end
end

amitmurthy · 2016-08-09T05:58:07Z

A better example for RemoteChannel will be something like a HTTPServer farming out individual requests to workers, or a pipeline of image processing tasks, but as these would involve external packages we should just stick with the contrived md5 example.

amitmurthy · 2016-08-09T06:08:47Z

Also consider adding the following :

The above examples show how channels can be used for inter-task communication, and remote channels for inter-process communication. They are well suited for building work pipelines or distributing work across tasks and processes.

It is to be noted that for simpler scenarios, asyncmap which distributes work across tasks, and pmap which distributes work across nodes work well.

For example, md5 calculations can be performed locally with

filenames=[]
for (root, _, files) in walkdir("..")
    append!(filenames, [joinpath(root,f) for f in files])
end 

asyncmap(f->readstring(open(`md5 $f`)[1]), filenames)

or distributed with

pmap(f->readstring(open(`md5 $f`)[1]), filenames)

kshyatt · 2017-12-22T20:07:58Z

The examples we have now are great! This PR has been totally superseded.

kshyatt added docs This change adds or pertains to documentation parallelism Parallel or distributed computation labels Jul 28, 2016

tkelman reviewed Jul 28, 2016
View reviewed changes

Added minimal code example to Channels, fix #14312

9fe3fc0

kshyatt force-pushed the ksh/docchannels branch from aee9adb to 9fe3fc0 Compare July 28, 2016 19:17

Add in example for take! on remote workers

997455c

tkelman reviewed Jul 29, 2016
View reviewed changes

kshyatt closed this Dec 22, 2017

kshyatt deleted the ksh/docchannels branch December 22, 2017 20:08

JeffBezanson mentioned this pull request May 21, 2018

Documentation of Channels could really use an example #14312

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added minimal code example to Channels, fix #14312 #17674

Added minimal code example to Channels, fix #14312 #17674

kshyatt commented Jul 28, 2016

tkelman Jul 28, 2016

kshyatt Jul 28, 2016

ViralBShah commented Jul 28, 2016

ViralBShah commented Jul 28, 2016

mweastwood commented Jul 28, 2016

kshyatt commented Jul 28, 2016

tkelman Jul 29, 2016

amitmurthy commented Jul 29, 2016

amitmurthy commented Jul 29, 2016

amitmurthy commented Aug 3, 2016

kshyatt commented Aug 3, 2016

vtjnash commented Aug 3, 2016

amitmurthy commented Aug 3, 2016

vtjnash commented Aug 3, 2016

amitmurthy commented Aug 9, 2016

amitmurthy commented Aug 9, 2016

amitmurthy commented Aug 9, 2016

kshyatt commented Dec 22, 2017

Added minimal code example to Channels, fix #14312 #17674

Added minimal code example to Channels, fix #14312 #17674

Conversation

kshyatt commented Jul 28, 2016

tkelman Jul 28, 2016

Choose a reason for hiding this comment

kshyatt Jul 28, 2016

Choose a reason for hiding this comment

ViralBShah commented Jul 28, 2016

ViralBShah commented Jul 28, 2016

mweastwood commented Jul 28, 2016

kshyatt commented Jul 28, 2016

tkelman Jul 29, 2016

Choose a reason for hiding this comment

amitmurthy commented Jul 29, 2016

amitmurthy commented Jul 29, 2016

amitmurthy commented Aug 3, 2016

kshyatt commented Aug 3, 2016

vtjnash commented Aug 3, 2016

amitmurthy commented Aug 3, 2016

vtjnash commented Aug 3, 2016

amitmurthy commented Aug 9, 2016

amitmurthy commented Aug 9, 2016

amitmurthy commented Aug 9, 2016

kshyatt commented Dec 22, 2017