How does buffer in bash pipe work on linux?

Fermat's Little Student Published at Java

Fermat's Little Student :

Think of a simple command as following:

cmd1 | cmd2

Does cmd2 start to execute

as soon as cmd1 outputs something
or only if cmd1 completely finishes and exits?

In case 1 when cmd1 outputs faster than the speed at which cmd2 consumes, or simply in case 2, there has to be a buffer for the intermediate output.

Where is that buffer located? Is it in memory or on disk?
Is it possible to configure the buffer's location and size?
What would happen when the buffer is not big enough?

Wyzard :

The cmd2 program starts to run immediately, but whenever it tries to read input, it'll "block" (stop and wait) if necessary until some is available. This is done automatically by the kernel. Other than that, the two programs can run concurrently (including at the same time on different CPU cores).

The buffer between the two processes is held by the kernel, and it's in memory (though it might be possible for it to be paged out — I'm not sure). The default size of the buffer doesn't seem to be configurable, but programs can request a bigger size for a specific pipe, and the limit for that is configurable by writing to the /proc/sys/fs/pipe-max-size file (which, being in /proc, isn't a actually a file on disk; it's a virtual file that accesses a setting in the kernel.) See this question for more information.

If cmd1 tries to write but the buffer is full, it will block until some space becomes available in the buffer (which happens when cmd2 reads some of the buffered data). So if cmd1 is producing output too fast, it'll be automatically be slowed down by having to wait for cmd2 to consume the output.

If the buffer is small, the programs may end up blocking more frequently while waiting on it, which can make them take longer to finish because they'll be spending more time waiting.

In general, there are two categories that most pipelines are likely to fall into:

cmd1 produces output faster than cmd2 consumes it: the buffer is usually full (or close to it) and cmd1 often blocks when trying to write, which slows it down to match the speed of cmd2. cmd2 is able to run at full speed because input is always available in the buffer, so it rarely has to block on reading.
cmd2 consumes input faster than cmd1 produces it: the buffer is usually empty (or close to it), and cmd2 often blocks when trying to read, which slows it down to match the speed of cmd1. cmd1 is able to run at full speed because there's always space available for writing to the buffer, so it rarely has to block on writing.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2020-06-6

Comments

0 comments

TOP Ranking

Article

How does buffer in bash pipe work on linux?

How does buffer in bash pipe work on linux?

pump.io port in URL

How to import an asset in swift using Bundle.main.path() in a react-native native module

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Inner Loop design for webscrapping

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

mysql.connector.errors.InterfaceError: 2003: Can't connect to MySQL server on '127.0.0.1:3306' (111 Connection refused)

Removed zsh, but forgot to change shell back to bash, and now Ubuntu crashes (wsl)

ggplotly no applicable method for 'plotly_build' applied to an object of class "NULL" if statements

How to run blender on webserver?

Resetting Value of <input type="time"> in Firefox

Converting a class method to a property with a backing field

Ambiguous use of 'init' with CFStringTransform and Swift 3

Execute ./script.sh with a crontab

How to set tab order for array of cluster,where cluster elements have different data types in LabVIEW?

How to pass data to the ng2-bs3-modal?

Retrieve Element Tag Value XML Using Bash

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

SQL Server : need add a dot before two last character

Making Array From Page Elements in jQuery

Laravel's ORM sync with timestamps doesn't update timestamps

Do animations stop css changes after animation completion?