diff --git a/README.md b/README.md new file mode 100644 index 0000000..6c6efcc --- /dev/null +++ b/README.md @@ -0,0 +1,13 @@ +# Python PySpark Training Repository + +## Installation + - [Python 3.10](https://www.python.org/downloads/) + - pyspark=3.1.1 + - [Spark 3.1.1](https://spark.apache.org/downloads.html) + - [Hadoop 3.3.6](https://hadoop.apache.org/releases.html) + - [Java JDK 11](https://www.oracle.com/java/technologies/downloads/#java11) + - (not mandatory) [Anaconda for conda](https://www.anaconda.com/download/) + +## Run Python Test + - path from src/test_pyspark_training + - `pytest -k test_` \ No newline at end of file