There is a newer version of SparkSpark is Hadoop MapReduce done in memory
Accessing Spark 2.4.4-foss-2019a-Hadoop-2.7-Java-11-Python-3.7.2
To load the module for Spark 2.4.4-foss-2019a-Hadoop-2.7-Java-11-Python-3.7.2 please use this command on the BEAR systems (BlueBEAR, BEARCloud VMs, and CaStLeS VMs):
module load Spark/2.4.4-foss-2019a-Hadoop-2.7-Java-11-Python-3.7.2
BEAR Apps Version
EL8-cascadelake — EL8-haswell — Ubuntu20.04-haswell
The listed architectures consist of two part: OS-CPU.
- BlueBEAR: The OS used on BlueBEAR is represented by EL and there are several different processor (CPU) types available on BlueBEAR. More information about the processor types on BlueBEAR is available on the BlueBEAR Job Submission page.
- BEAR and CaStLeS Cloud VMs: These VMs can have one of two OSes. Those with access to a BEAR Cloud or CaStLeS VM should check that the listed architectures for an application include the OS of VM being used. The VMs, irrespective of OS, will use the haswell CPU type.
- py4j 0.10.8.1
- pyspark 2.4.4
For more information visit the Spark website.
This version of Spark has a direct dependency on: foss/2019a Java/11 Python/3.7.2-GCCcore-8.2.0
These versions of Spark are available on the BEAR systems (BlueBEAR, BEARCloud VMs, and CaStLeS VMs). These will be retained in accordance with our Applications Support and Retention Policy.
|Version||BEAR Apps Version|
Last modified on 23rd September 2019