Apache Spark: Cluster Computing with Working Sets

Gaurav Sen

55 лет назад

16,084 Просмотров

Скачать видео

Комментарии:

@WinnersDontWin - 21.12.2024 12:21

wait, what happens when results are too big for memory?

Ответить

@soumyakantarath4551 - 15.12.2024 09:57

One more point I would like to add, RDD is made of Trie data structure that makes RDD to get the lineage and evaluate. if you can find out the RDD and Data Frame architecture docs and make a video it would be great help.

Ответить

@uttamsharma9042 - 14.12.2024 14:59

Thanks

Ответить

@TheGsinghg - 14.12.2024 13:07

what makes you say that backprop will benefit by 1000x with spark? can you share the source?

Ответить

@KartikNaik18 - 14.12.2024 10:37

Hi Gaurav can you please do one on Druid

Ответить

@TJ-sv6bw - 14.12.2024 09:10

Nice one. Can you also do a deep dive on Google Dataflow and compare with Spark Streaming?

Ответить

@Moddingmonster11 - 14.12.2024 08:25

Bro if you have time please make a single video with everything covered up in your channel as a full video to system design

It will be more helpful 🙂

Ответить

@ayushbachan6113 - 14.12.2024 07:09

Hi Gaurav, can you please do apache airflow

Ответить

@shubhamjagtap108 - 14.12.2024 06:51

Thanks Gaurav. I often work with AWS EMR (EC2) along with spark-3. 4.1 in my job. We have large batch processing clusters running on a daily scheduled Airflow DAGs. Acc to me if we use MapReduce with Spark it leverages computation with processing speed at a large scale.

Ответить