Why does flatMap take in a function that returns stream instead of Collection?

Teddy Tsai

Why does the flatMap operation require a function which returns Stream instead of a function that returns a Collection? Any particular reason it forces the user to do the stream conversion manually?

Reading the source code example I can see that this way the compatibilioty can be extended to arrays but wouldn't an overload of flatMap achieve the same result?

// Java 8 source code example:
Stream<String> words = lines.flatMap(line -> Stream.of(line.split(" +")));

What are the use cases where it's better to have the streaming process explicited?

Example: why am I forced to do this

Map<String, List<String>> map = new HashMap<String, List<String>>();
List<String> flatList = map.entrySet().stream().flatMap(e -> e.getValue().stream()).collect(Collectors.toList());

instead of this?

Map<String, List<String>> map = new HashMap<String, List<String>>();
List<String> flatList = map.entrySet().stream().flatMap(Map.Entry::getValue).collect(Collectors.toList());

Alexander Ivanchenko

Why does the flatMap() operation require a function which returns Stream instead of a function that returns a Collection?

There are many reasons for that:

Stream is a means of iteration, i.e. we're not storing the data in the stream, its purpose is to iterate lazily many over the source of data, which can be a String, Array, IO-Stream, etc.
Secondly, Stream operations are divided into two groups: terminal, which are meant to produce the result and terminate the execution of the stream pipeline (i.e. it's not possible to apply any operation after a terminal one), and intermediate operations, which transform the stream. Intermediate operations are always lazy. A stream takes elements from the source one-by-one and processes them lazily, i.e. operations occur only when needed. Don't a new stream with a chain of nested for-loops, they act differently. Every intermediate operation produces a new stream.

Here's a quote from the API documentation:

Streams differ from collections in several ways:

No storage. A stream is not a data structure that stores elements; instead, it conveys elements from a source such as a data structure, an array, a generator function, or an I/O channel, through a pipeline of computational operations.

Laziness-seeking. Many stream operations, such as filtering, mapping, or duplicate removal, can be implemented lazily, exposing opportunities for optimization. For example, "find the first String with three consecutive vowels" need not examine all the input strings. Stream operations are divided into intermediate (Stream-producing) operations and terminal (value- or side-effect-producing) operations. Intermediate operations are always lazy.

Since Stream are internal iterators over the source of data which can have a different nature (not necessarily a Collectoin) it's reasonable for flatMap() to expect data in a predictable uniform shape, not an Array, Collection, Iterable, etc. but another internal iterator, i.e. another Stream, so that's obvious how to deal with it.

Any option that you can up with would be less intuitive. If flatMap() was implemented in such a way so that it would expect a function producing Collection how would you deal with strings, arrays, IO-Streams, various implementations of Iterable? By dumping the data into a Collection - that's not an option. Same issue would arise if we imagine that flatMap() required Iterable, how would we produce Iterable from a String? Streams are designed to be versatile.

I suspect that your judgement regarding flatMap() is biased because you are not accustomed to it. When you embrace the idea that a Stream is an Internal Iterator, the fact that operation for flattening the data expect function producing another iterator would be perceived as more intuitive.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2022-11-18

Comments

0 comments

Why does Java Stream.map take a Function<? super P_OUT, ? extends R> mapper as input instead of Function<P_OUT, ? extends R>?

Why does the Promise constructor require a function that calls 'resolve' when complete, but 'then' does not - it returns a value instead?

TOP Ranking

Article

Why does flatMap take in a function that returns stream instead of Collection?

Why does flatMap take in a function that returns stream instead of Collection?

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

pump.io port in URL

Loopback Error: connect ECONNREFUSED 127.0.0.1:3306 (MAMP)

How to import an asset in swift using Bundle.main.path() in a react-native native module

Spring Boot JPA PostgreSQL Web App - Internal Authentication Error

3D Touch Peek Swipe Like Mail

BigQuery - concatenate ignoring NULL

How to how increase/decrease compared to adjacent cell

Make a B+ Tree concurrent thread safe

Emulator wrong screen resolution in Android Studio 1.3

Can a 32-bit antivirus program protect you from 64-bit threats

Svchost high CPU from Microsoft.BingWeather app errors

Double spacing in rmarkdown pdf

Unable to use switch toggle for dark mode in material-ui

java.lang.NullPointerException: Cannot read the array length because "<local3>" is null

Google Chrome Translate Page Does Not Work

How to fix "pickle_module.load(f, **pickle_load_args) _pickle.UnpicklingError: invalid load key, '<'" using YOLOv3?

Using Response.Redirect with Friendly URLS in ASP.NET

Bootstrap 5 Static Modal Still Closes when I Click Outside

SSIS setting column with data in Script Component