A Data Scientist is only as good as the data they have access to. Most companies store their data in variety of formats across databases and text files.
This is where Data Engineers come in — they develop the data pipelines: interfaces and mechanisms for the exchange of and access to data, often using API's. The data may or may not be transformed, and is often processed in real time (via streaming) instead of in batches.