小编Man*_*mal的帖子

获取箱线图的数据 - 熊猫

我需要获取生成的统计数据,以便在Pandas中绘制一个箱形图(使用数据框创建箱形图).即Quartile1,Quartile2,Quartile3,较低的晶须值,较高的晶须值和异常值.我尝试了以下查询来绘制boxplot.

import pandas as pd
df = pd.DataFrame(np.random.rand(100, 5), columns=['A', 'B', 'C', 'D', 'E'])
pd.DataFrame.boxplot(df,return_type = 'both')
Run Code Online (Sandbox Code Playgroud)

有没有办法做到而不是手动计算值?

python pandas

7
推荐指数
2
解决办法
1万
查看次数

创建具有两个值的字典时错过的值

我有两个列表如下.

count = (1, 0, 0, 2, 0, 0, 1, 1, 1, 2)
bins = [[2.0, 3.0], [3.0, 4.0], [4.0, 5.0], [5.0, 6.0], [6.0, 7.0], [7.0, 8.0], [8.0, 9.0], [9.0, 10.0], [10.0, 11.0], [11.0, 12.0], [12.0]]
Run Code Online (Sandbox Code Playgroud)

我尝试使用以下创建字典;

dictionary = dict(itertools.izip(count, bins))
Run Code Online (Sandbox Code Playgroud)

它给了我 {"0": [7.0, 8.0], "1": [10.0, 11.0], "2": [11.0, 12.0]}

它只提供唯一的键值,但我需要得到所有的对,如下所示.

{"0": [3.0, 4.0],"0": [4.0, 5.0],"0": [6.0, 7.0],"0": [7.0, 8.0], "1": [2.0, 3.0],"1": [8.0, 9.0], "1": [9.0, 10.0], "1": [10.0, 11.0], "2": [6.0, 7.0] ,"2": [11.0, 12.0]}
Run Code Online (Sandbox Code Playgroud)

或者上面词典中的键和值的交换是可以接受的.(因为键应该是唯一的)我该怎么做?

python dictionary python-itertools

6
推荐指数
1
解决办法
57
查看次数

在Predictionio中训练数据时的例外情况

我正在尝试部署快速入门指南中提到的推荐引擎.我完成了构建引擎的步骤.现在我想训练推荐引擎.我在快速入门指南中提到过.(执行pio train).然后我得到了冗长的错误日志,我无法在这里粘贴.所以我把错误放在前几行.

[INFO] [Console$] Using existing engine manifest JSON at /home/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/manifest.json
[INFO] [Runner$] Submission command: /home/PredictionIO/PredictionIO-0.9.6/vendors/spark-1.5.1-bin-hadoop2.6/bin/spark-submit --class io.prediction.workflow.CreateWorkflow --jar/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/target/scala-2.10/template-scala-parallel-recommendation_2.10-0.1-SNAPSHOT.jar,file:/home/PredictionIO/PredictionIO-0.9.6/bndation/target/scala-2.10/template-scala-parallel-recommendation-assembly-0.1-SNAPSHOT-deps.jar --files file:/home/PredictionIO/PredictionIO-0.9.6/conf/log4j.properties --driver/home/PredictionIO/PredictionIO-0.9.6/conf:/home/PredictionIO/PredictionIO-0.9.6/lib/postgresql-9.4-1204.jdbc41.jar:/home/PredictionIO/PredictionIO-0.9.6/lib/mysql-connector-jav file:/home/PredictionIO/PredictionIO-0.9.6/lib/pio-assembly-0.9.6.jar --engine-id qokYFr4rwibijNjabXeVSQKKFrACyrYZ --engine-version ed29b3e2074149d483aa85b6b1ea35a52dbbdb9a --et file:/home/PredictionIO/PredictionIO-0.9.6/bin/MyRecommendation/engine.json --verbosity 0 --json-extractor Both --env PIO_ENV_LOADED=1,PIO_STORAGE_REPOSITORIES_METADATA_NAME=pFS_BASEDIR=/root/.pio_store,PIO_HOME=/home/PredictionIO/PredictionIO-0.9.6,PIO_FS_ENGINESDIR=/root/.pio_store/engines,PIO_STORAGE_SOURCES_PGSQL_URL=jdbc:postgresql://localhost/pGE_REPOSITORIES_METADATA_SOURCE=PGSQL,PIO_STORAGE_REPOSITORIES_MODELDATA_SOURCE=PGSQL,PIO_STORAGE_REPOSITORIES_EVENTDATA_NAME=pio_event,PIO_STORAGE_SOURCES_PGSQL_PASSWORD=pio,PIURCES_PGSQL_TYPE=jdbc,PIO_FS_TMPDIR=/root/.pio_store/tmp,PIO_STORAGE_SOURCES_PGSQL_USERNAME=pio,PIO_STORAGE_REPOSITORIES_MODELDATA_NAME=pio_model,PIO_STORAGE_REPOSITORIES_EVENTDGSQL,PIO_CONF_DIR=/home/PredictionIO/PredictionIO-0.9.6/conf
[INFO] [Engine] Extracting datasource params...
[INFO] [WorkflowUtils$] No 'name' is found. Default empty String will be used.
[INFO] [Engine] Datasource params: (,DataSourceParams(MyApp3,None))
[INFO] [Engine] Extracting preparator params...
[INFO] [Engine] Preparator params: (,Empty)
[INFO] [Engine] Extracting serving params...
[INFO] [Engine] Serving params: …
Run Code Online (Sandbox Code Playgroud)

python recommendation-engine apache-spark predictionio data-science

6
推荐指数
1
解决办法
733
查看次数

将深层嵌套的json从facebook转换为python中的dataframe

我正在尝试获取有关Facebook帖子的评论和评论的用户详细信息.我正在使用python facebook-sdk包.代码如下.

import facebook as fi
import json

graph = fi.GraphAPI('Access Token')
data = json.dumps(graph.get_object('DSIfootcandy/posts'))
Run Code Online (Sandbox Code Playgroud)

从上面,我得到一个高度嵌套的json.在这里,我将只为fb中的一个帖子添加一个json字符串.

{
  "paging": {
    "next": "https://graph.facebook.com/v2.0/425073257683630/posts?access_token=&limit=25&until=1449201121&__paging_token=enc_AdD0DL6sN3aDZCwfYY25rJLW9IZBZCLM1QfX0venal6rpjUNvAWZBOoxTjbOYZAaFiBImzMqiv149HPH5FBJFo0nSVOPqUy78S0YvwZDZD",
    "previous": "https://graph.facebook.com/v2.0/425073257683630/posts?since=1450843741&access_token=&limit=25&__paging_token=enc_AdCYobFJpcNavx6STzfPFyFe6eQQxRhkObwl2EdulwL7mjbnIETve7sJZCPMwVm7lu7yZA5FoY5Q4sprlQezF4AlGfZCWALClAZDZD&__previous=1"
  },
  "data": [
    {
      "picture": "https://fbcdn-photos-e-a.akamaihd.net/hphotos-ak-xfa1/v/t1.0-0/p130x130/1285_5066979392443_n.png?oh=b37a42ee58654f08af5abbd4f52b1ace&oe=570898E7&__gda__=1461440649_aa94b9ec60f22004675c4a527e8893f",
      "is_hidden": false,
      "likes": {
        "paging": {
          "cursors": {
            "after": "MTU3NzQxODMzNTg0NDcwNQ==",
            "before": "MTU5Mzc1MjA3NDE4ODgwMA=="
          }
        },
        "data": [
          {
            "id": "1593752074188800",
            "name": "Maduri Priyadarshani"
          },
          {
            "id": "427605680763414",
            "name": "Darshi Mashika"
          },
          {
            "id": "599793563453832",
            "name": "Shakeer Nimeshani Shashikala"
          },
          {
            "id": "1577418335844705",
            "name": "Däzlling Jalali Muishu"
          }
        ]
      },
      "from": {
        "category": …
Run Code Online (Sandbox Code Playgroud)

python csv json facebook

5
推荐指数
1
解决办法
746
查看次数

如何生成日期系列以占用Google BiqQuery中缺少的日期?

我想从谷歌大查询表中获取每日销售总额.我使用了以下代码.

select Day(InvoiceDate) date, Sum(InvoiceAmount) sales from test_gmail_com.sales 
where year(InvoiceDate) = Year(current_date()) and
Month(InvoiceDate) = Month(current_date())
group by date order by date
Run Code Online (Sandbox Code Playgroud)

从上面的查询中,它只给出了表中每日销售额的总和.有些日子有可能没有任何销售.对于那种情况,我需要得到日期和总和应该为0.例如,在每个月应该30 0r 31行与销售额之和.示例如下所示.本月的第4天没有销售.所以它的总和应该是0.

date | sales
-----+------
1    |   259
-----+------
2    |   359
-----+------
3    |   45
-----+------
4    |    0
-----+------
5    |  156
Run Code Online (Sandbox Code Playgroud)

是否可以在Big-query中进行?基本上日期列应该是1 - 28/29/30或31st的系列,具体取决于一年中的月份

sql google-bigquery

4
推荐指数
2
解决办法
5419
查看次数

如何在烧瓶应用程序模型中添加带有值(预定义)的下拉列表?

我是烧瓶的新手,我想将下拉列表添加到具有预定义值的表单中(不是从数据库中获取)。我创建了一个模型如下。

class DeliveryDetails(Model):

    _tablename_ = 'deliverydetails'
    id = Column(Integer, primary_key=True)
    customer_name = Column(String(250), nullable=False)
    delivery_type = Column(String(250), nullable=False)
Run Code Online (Sandbox Code Playgroud)

并查看如下。

class DeliveryDetailsView(ModelView, DeleteMixin):
    datamodel = SQLAInterface(models.DeliveryDetails)
    list_columns = ['customer_name','delivery_type']
    search_columns = ['customer_name','delivery_type']
    edit_columns = ['customer_name','delivery_type']
    add_columns = edit_columns
    label_columns = {
        'customer_name': _("Customer Name"),
        'delivery_type': _("Delivery Type") }
Run Code Online (Sandbox Code Playgroud)

我想表明Air, Land, Sea作为Delivery Types在下拉列表中。请让我知道是否可以按照我提到的做?

python sqlalchemy flask superset flask-sqlalchemy

3
推荐指数
1
解决办法
8280
查看次数