scipy.stats.linregress返回与斜率相对应的 p 值,但没有截距的 p 值。考虑文档中的以下示例:
>>> from scipy import stats
>>> import numpy as np
>>> x = np.random.random(10)
>>> y = np.random.random(10)
>>> slope, intercept, r_value, p_value, std_err = stats.linregress(x,y)
>>> p_value
0.40795314163864016
Run Code Online (Sandbox Code Playgroud)
根据文档,p-value是“假设检验的两侧 p 值,其零假设是斜率为零。” 我想获得相同的统计数据,但是是截距而不是斜率。
statsmodels.regression.linear_model.OLS返回两个开箱即用的系数的 p 值:
>>> import numpy as np
>>> import statsmodels.api as sm
>>> X = sm.add_constant(x)
>>> model = sm.OLS(y,X)
>>> results = model.fit()
>>> results.pvalues
array([ 0.00297559, 0.40795314])
Run Code Online (Sandbox Code Playgroud)
仅使用 scipy,如何获得截距的 p 值(0.40795314163864016)?
我正在使用factory_boy替换Django应用中的装置。我有一个产品模型,其中应该包含许多优惠和商家。
#models.py
class Product(models.Model):
name = models.CharField()
class Merchant(models.Model):
product = models.ForeignKey(Product)
name = models.CharField()
class Offer(models.Model):
product = models.ForeignKey(Product)
price = models.DecimalField(max_digits=10, decimal_places=2)
Run Code Online (Sandbox Code Playgroud)
我想要一个工厂来创建具有多个商家和多个要约的产品。
#factories.py
import random
from models import Offer, Merchant, Product
class OfferFactory(factory.django.DjangoModelFactory):
FACTORY_FOR = Offer
product = factory.SubFactory(ProductFactory)
price = random.randrange(0, 50000, 1)/100.0
class MerchantFactory(factory.django.DjangoModelFactory):
FACTORY_FOR = Merchant
product = factory.SubFactory(ProductFactory)
name = factory.Sequence(lambda n: 'Merchant %s' % n)
url = factory.sequence(lambda n: 'www.merchant{n}.com'.format(n=n))
class ProductFactory(factory.django.DjangoModelFactory):
FACTORY_FOR = Product
name = "test product"
offer = factory.RelatedFactory(OfferFactory, …Run Code Online (Sandbox Code Playgroud) 看起来 MapReduce 框架的本质就是处理许多文件。因此,当我收到错误告诉我使用了太多文件时,我怀疑我做错了什么。
如果我使用运行程序和三个目录运行作业inline,它会起作用:
$ python mr_gps_quality.py /Volumes/Logs/gps/ByCityLogs/city1/0[1-3]/*.log -r inline --no-output --output-dir city1_results/gps_quality/2015/03/
Run Code Online (Sandbox Code Playgroud)
local但是如果我使用运行程序(以及相同的三个目录)运行它,它会失败:
$ python mr_gps_quality.py /Volumes/Logs/gps/ByCityLogs/city1/0[1-3]/*.log -r local --no-output --output-dir city1_results/gps_quality/2015/03/
[...output clipped...]
> /Users/andrewsturges/sturges/mr/env/bin/python mr_gps_quality.py --step-num=0 --mapper /var/folders/32/5vqk9bjx4c773cpq4pn_r80c0000gn/T/mr_gps_quality.andrewsturges.20150604.170016.046323/input_part-00249 > /var/folders/32/5vqk9bjx4c773cpq4pn_r80c0000gn/T/mr_gps_quality.andrewsturges.20150604.170016.046323/step-k0-mapper_part-00249
Traceback (most recent call last):
File "mr_gps_quality.py", line 53, in <module>
MRGPSQuality.run()
File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/job.py", line 494, in run
mr_job.execute()
File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/job.py", line 512, in execute
super(MRJob, self).execute()
File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/launch.py", line 147, in execute
self.run_job()
File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/launch.py", line 208, in run_job
runner.run()
File "/Users/andrewsturges/sturges/mr/env/lib/python2.7/site-packages/mrjob/runner.py", line …Run Code Online (Sandbox Code Playgroud)