我注意到,当仅使用一个操作启动这堆代码时,我启动了三个作业。
\nfrom typing import List\nfrom pyspark.sql import DataFrame\nfrom pyspark.sql.types import StructType, StructField, StringType\nfrom pyspark.sql.functions import avg\n\ndata: List = [("Diamant_1A", "TopDiamant", "300", "rouge"),\n ("Diamant_2B", "Diamants pour toujours", "45", "jaune"),\n ("Diamant_3C", "Mes diamants pr\xc3\xa9f\xc3\xa9r\xc3\xa9s", "78", "rouge"),\n ("Diamant_4D", "Diamants que j\'aime", "90", "jaune"),\n ("Diamant_5E", "TopDiamant", "89", "bleu")\n ]\n\nschema: StructType = StructType([ \\\n StructField("reference", StringType(), True), \\\n StructField("marque", StringType(), True), \\\n StructField("prix", StringType(), True), \\\n StructField("couleur", StringType(), True)\n ])\n\ndataframe: DataFrame = spark.createDataFrame(data=data,schema=schema)\n\ndataframe_filtree:DataFrame = dataframe.filter("prix > 50")\n\ndataframe_filtree.show()\nRun Code Online (Sandbox Code Playgroud)\n根据我的理解,我应该只得到一个。一项操作对应一项作业。\n我正在使用 Databricks。这可能是问题所在。我有 2 个问题:
\n