Pipeline

Mandatory assignment

Your task is to create a system with one parent process and two child processes where the children communicate using a pipe.

Open a terminal

Open a terminal and navigate to the mandatory directory.

List directory contents (ls)

The ls shell command list directory content. Execute the ls command in the terminal.

ls

You should now see this:

Makefile     bin     obj     src

The -F option appends a slash / to directory entries.

ls -F

The only file in the directory is Makefile. The directory contains the subdirectories bin, obj and src.

Makefile     bin/     obj/     src/

Add the -1 option to:

ls -F -1

, print each entry on a separate line:

Makefile
bin/
obj/
src/

Line numbering filter (nl)

The nl shell command is a filter that reads lines from stdin and echoes each line prefixed with a line number to stdout. In the terminal, type nl and press enter.

nl

The nl command now waits for stdin input. Type Some text and press enter.

Some text
     1 Some text

Type More text and press enter.

More text
     2 More text

For each line you type, the line is echoed back with a line number. Play around some more with the nl command. Press Ctrl+D (End Of File) to make the nl command exit.

Pipeline

One of the philosophies behind Unix is the motto do one thing and do it well. In this spirit, each shell command usually have a very specific purpose. More complicated commands can be constructed by combining simpler commands in a pipeline such that the output of one command becomes the input to another command.

The standard shell syntax for pipelines is to list multiple commands, separated by vertical bars | (the pipe character). In the below example the output from the ls -F -1 command is piped to input of the nl command.

ls -F -1 | nl

Now the result of ls -F -1 is printed with line numbers added at the front of each line.

    1 Makefile
    2 bin/
    3 obj/
    4 src/

Shell commands are ordinary programs

All shell commands are ordinary programs. When the shell executes a command, the shell first forks a new process and uses exec to make the new process execute the command program.

Which programs

The which utility locates a program executable in the user’s PATH. Use which to see which program executables the shell uses for the ls and nl commands.

which ls

The executables for the ls command program is found in the /bin directory.

/bin/ls

What about the nl command?

which nl

The executables for the nl command programs is also located in the /bin directory.

/bin/nl

System overview

Your task is to create a program that mimics the ls -F -1 | nl pipeline. The parent process uses fork to create two child processes. Child A uses exec to execute the ls -F -1 command and Child B uses exec to execute the nl command. The children uses a pipe to communicate. The stdout of Child A is redirected to the write end of the pipe. The stdin of the Child B is redirected to the read end of the pipe.

Pipe

How can you make the child process share a pipe? Which process should create the pipe? When should the pipe be created?

Parent forks twice

The parent must use fork twice, once for each child process.

Parent waits twice

The parent should use wait twice to wait for both child processes to terminate.

The children uses execlp

The child process should use execlp to execute their commands.

The children uses dup2

The child process should use dup2 to redirect stdout and stdin.

  • Process A redirects stdout to write to the pipe.
  • Process B redirects stdin to read from the pipe.

Close unused pipe file descriptors

Don’t forget to close unused pipe file descriptors, otherwise a reader or writer might be blocked .

AttemptConditionsResult
ReadEmpty pipe, writers attachedReader blocked
ReadEmpty pipe, no writer attachedEnd Of File (EOF) returned
WriteFull pipe, readers attachedWriter blocked
WriteNo readers attachedSIGPIPE

Closing a read descriptor to early may cause a SIGPIPE.

Beware of SIGPIPE

The default SIGPIPE signal handler causes the process to terminate.

Check return values

You should check the return values of all system calls to detect errors. On error use perror to print an error message and then terminate with exit(EXIT_FAILURE).

pipeline.c

Use the file src/pipeline.c to implement your solution.

  • You must add code the the main function. You must add code here.
  • After fork, Child A should execute the child_a function. You must add code here.
  • After fork, Child B should execute the child_b function. You must add code here.
  • You may add your own functions to further structure your solution.

Compile and run

Use make to compile.

make

Run the program.

./bin/pipeline

When running the finished program you should see output similar to this, where $ is your shell prompt.

$ ./bin/pipeline
    1 Makefile
    2 bin/
    3 obj/
    4 src/
$

Make sure you get the shell prompt $ back.

Code grading questions

Here are a few examples of general questions not directly related to the source code that you should be able to answer and discuss during the code grading.

  • What do we mean with a process pipeline?
  • What is a pipe?
  • How are pipes used to construct process pipelines?
  • Show an example of a proccess pipeline in the terminal (shell).
  • How are shell (terminal) commands implemented?
  • What happens to the file descriptor table after a successful creation of a new pipe?

Here are a few examples of questions that you should be able to answer, discuss and relate to the source code of you solution during the code grading.

  • How many times does the parent calls fork and why?
  • Why do the children need to call execlp()?
  • Explain how each child is able to redirect stdin or stdout from or to the pipe?
  • How will the consumer know when there is no more data to expect from the pipe?
  • Why is it important for a process to close any pipe file descriptors it does not intend to use?
  • What could happen if you close a read descriptor to early?