小编And*_*ges的帖子

scipy.stats.linregress - 获取截距的 p 值

scipy.stats.linregress返回与斜率相对应的 p 值,但没有截距的 p 值。考虑文档中的以下示例:

>>> from scipy import stats
>>> import numpy as np
>>> x = np.random.random(10)
>>> y = np.random.random(10)
>>> slope, intercept, r_value, p_value, std_err = stats.linregress(x,y)
>>> p_value
0.40795314163864016
Run Code Online (Sandbox Code Playgroud)

根据文档,p-value是“假设检验的两侧 p 值,其零假设是斜率为零。” 我想获得相同的统计数据,但是是截距而不是斜率。

statsmodels.regression.linear_model.OLS返回两个开箱即用的系数的 p 值:

>>> import numpy as np

>>> import statsmodels.api as sm

>>> X = sm.add_constant(x)
>>> model = sm.OLS(y,X)
>>> results = model.fit()
>>> results.pvalues
array([ 0.00297559,  0.40795314])    
Run Code Online (Sandbox Code Playgroud)

仅使用 scipy,如何获得截距的 p 值(0.40795314163864016)?

python statistics scipy statsmodels

8
推荐指数
1
解决办法
5158
查看次数

factory_boy:添加几个依赖对象

我正在使用factory_boy替换Django应用中的装置。我有一个产品模型,其中应该包含许多优惠和商家。

#models.py
class Product(models.Model):
    name = models.CharField()

class Merchant(models.Model):
    product = models.ForeignKey(Product)
    name = models.CharField()

class Offer(models.Model):
    product = models.ForeignKey(Product)
    price = models.DecimalField(max_digits=10, decimal_places=2)
Run Code Online (Sandbox Code Playgroud)

我想要一个工厂来创建具有多个商家和多个要约的产品。

#factories.py
import random
from models import Offer, Merchant, Product

class OfferFactory(factory.django.DjangoModelFactory):
    FACTORY_FOR = Offer

    product = factory.SubFactory(ProductFactory)
    price = random.randrange(0, 50000, 1)/100.0


class MerchantFactory(factory.django.DjangoModelFactory):
    FACTORY_FOR = Merchant

    product = factory.SubFactory(ProductFactory)
    name = factory.Sequence(lambda n: 'Merchant %s' % n)
    url = factory.sequence(lambda n: 'www.merchant{n}.com'.format(n=n))

 class ProductFactory(factory.django.DjangoModelFactory):
    FACTORY_FOR = Product 

    name = "test product"
    offer = factory.RelatedFactory(OfferFactory, …
Run Code Online (Sandbox Code Playgroud)

python django factory-boy

5
推荐指数
1
解决办法
1481
查看次数

使用 mrjob v0.4.4 时,为什么我会收到 [Errno 7] Argument list too long 和 OSError: [Errno 24] Too much open files?

看起来 MapReduce 框架的本质就是处理许多文件。因此,当我收到错误告诉我使用了太多文件时,我怀疑我做错了什么。

如果我使用运行程序和三个目录运行作业inline,它会起作用:

$ python mr_gps_quality.py  /Volumes/Logs/gps/ByCityLogs/city1/0[1-3]/*.log -r inline --no-output --output-dir city1_results/gps_quality/2015/03/
Run Code Online (Sandbox Code Playgroud)

local但是如果我使用运行程序(以及相同的三个目录)运行它,它会失败:

$ python mr_gps_quality.py  /Volumes/Logs/gps/ByCityLogs/city1/0[1-3]/*.log -r local --no-output --output-dir city1_results/gps_quality/2015/03/

[...output clipped...]

> /Users/andrewsturges/sturges/mr/env/bin/python mr_gps_quality.py --step-num=0 --mapper /var/folders/32/5vqk9bjx4c773cpq4pn_r80c0000gn/T/mr_gps_quality.andrewsturges.20150604.170016.046323/input_part-00249 > /var/folders/32/5vqk9bjx4c773cpq4pn_r80c0000gn/T/mr_gps_quality.andrewsturges.20150604.170016.046323/step-k0-mapper_part-00249
Traceback (most recent call last):
  File "mr_gps_quality.py", line 53, in <module>
    MRGPSQuality.run()
  File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/job.py", line 494, in run
    mr_job.execute()
  File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/job.py", line 512, in execute
    super(MRJob, self).execute()
  File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/launch.py", line 147, in execute
    self.run_job()
  File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/launch.py", line 208, in run_job
    runner.run()
  File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/runner.py", line …
Run Code Online (Sandbox Code Playgroud)

python mrjob

2
推荐指数
1
解决办法
2万
查看次数

标签 统计

python ×3

django ×1

factory-boy ×1

mrjob ×1

scipy ×1

statistics ×1

statsmodels ×1