小编Gau*_*wla的帖子

doc1 = {"sentence": "Today is a sunny day."}
doc2 = {"sentence": " Today is a sunny day but tomorrow it might rain"}
doc3 = {"sentence": "I know I am awesome"}
doc4 = {"sentence": "The taste of your dish is awesome"}
doc5 = {"sentence": "The taste of banana shake is good"}

# Indexing the above docs

es.index(index="english",doc_type="sentences",id=1,body=doc1)

es.index(index="english",doc_type="sentences",id=2,body=doc2)

es.index(index="english",doc_type="sentences",id=3,body=doc3)

es.index(index="english",doc_type="sentences",id=4,body=doc4)

es.index(index="english",doc_type="sentences",id=5,body=doc5)

Run Code Online (Sandbox Code Playgroud)

查询1

res = es.search(index="english",body={"from":0,"size":5,
                                  "query":
                                      {"match_phrase":
                                          {"sentence":{"query":"Today is a sunny day"}
                                          }},

                                          "explain":False})

Run Code Online (Sandbox Code Playgroud)

查询2

 res = es.search(index="english",body={"from":0,"size":5,
                                  "query":{
                                    "bool":{ …

Run Code Online (Sandbox Code Playgroud)

elasticsearch match-phrase

Gau*_*wla

2018 09-07

5
推荐指数

1
解决办法

837
查看次数

如何将输入文件读入闪亮的？

我的闪亮应用程序基于单个.csv文件的数据.所以我需要在启动时输入数据.这样,如果有人在他们的系统上打开应用程序,结果将由应用程序正确显示.我怎样才能做到这一点？

r shiny

Gau*_*wla

2014 09-18

3
推荐指数

1
解决办法

3124
查看次数

如何在Pyspark中定义空数据框并附加相应的数据帧？

所以我想从目录中读取csv文件,作为pyspark数据帧,然后将它们附加到单个数据帧中.在pyspark中没有得到替代品,就像我们在熊猫中所做的那样.

例如在Pandas,我们做:

files=glob.glob(path +'*.csv')

df=pd.DataFrame() 

for f in files:
    dff=pd.read_csv(f,delimiter=',')
    df.append(dff)

Run Code Online (Sandbox Code Playgroud)

在Pyspark,我试过这个,但没有成功

schema=StructType([])
union_df = sqlContext.createDataFrame(sc.emptyRDD(),schema)

for f in files:
    dff = sqlContext.read.load(f,format='com.databricks.spark.csv',header='true',inferSchema='true',delimiter=',')
    df=df.union_All(dff)

Run Code Online (Sandbox Code Playgroud)

非常感谢任何帮助.

谢谢

pyspark pyspark-sql

Gau*_*wla

lucky-day

3
推荐指数

4
解决办法

2万
查看次数