无法运行 AWS Glue Sagemaker Notebook,它在启动时抛出错误。如果我错过了什么,有人可以告诉我吗?
PS:按照错误日志中的建议,我重新启动了内核几次,但没有帮助。
import sys
from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext
from awsglue.context import GlueContext
from awsglue.job import Job
glueContext = GlueContext(SparkContext.getOrCreate())
Run Code Online (Sandbox Code Playgroud)
错误
The code failed because of a fatal error:
Error sending http request and maximum retry encountered..
Some things to try:
a) Make sure Spark has enough available resources for Jupyter to create a Spark context.
b) Contact your Jupyter administrator to make sure the Spark magics library is …Run Code Online (Sandbox Code Playgroud) 我正在尝试将数据框保存为 Databricks 上的镶木地板文件,得到 ArrowTypeError。
Databricks 运行时版本:7.6 ML(包括 Apache Spark 3.0.1、Scala 2.12)
ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column inv_yr with type int32')
Run Code Online (Sandbox Code Playgroud) 有没有办法使用 MLflow 记录数据集的描述性统计信息?如果有的话可以分享一下详情吗?