小编Mar*_*cel的帖子

替代numpy roll而不复制数组

我正在做类似下面的代码,我对np.roll()函数的性能不满意.我总结了baseArray和otherArray,其中baseArray在每次迭代中由一个元素滚动.但是当我滚动它时我不需要baseArray的副本,我宁愿选择一个视图,例如当我将baseArray与其他数组相加并且如果baseArray被滚动两次时,则basearray的第二个元素与第0个元素相加otherArray,baseArray的第3个元素与otherArray等的第1个元素相加.

IE实现与np.roll()相同的结果但不复制数组.

import numpy as np
from numpy import random
import cProfile

def profile():
    baseArray = np.zeros(1000000)
    for i in range(1000):
        baseArray= np.roll(baseArray,1)
        otherArray= np.random.rand(1000000)
        baseArray=baseArray+otherArray

cProfile.run('profile()')

Run Code Online (Sandbox Code Playgroud)

输出(注意第3行 - 滚动功能):

         9005 function calls in 26.741 seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    5.123    5.123   26.740   26.740 <ipython-input-101-9006a6c0d2e3>:5(profile)
        1    0.001    0.001   26.741   26.741 <string>:1(<module>)
     1000    0.237    0.000    8.966    0.009 numeric.py:1327(roll)
     1000    0.004    0.000    0.005    0.000 numeric.py:476(asanyarray)
        1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects} …

Run Code Online (Sandbox Code Playgroud)

python performance numpy

Mar*_*cel

2016 03-10

8
推荐指数

1
解决办法

3054
查看次数

如何预测字段是否存在

如果我有类似结构的文件如下.我正在使用计算结果更新它们,我想知道结果是否已经插入到文档中.让我们说每个文档我运行计算'c'和计算'd'.现在我想显示所有文档的表格,并显示是否已经执行了计算'd'.对于这个表我不关心计算'c'.

{
"_id":1
"a":1,
"resultsOfComputation":{
   "c":{large embedded document},
   "d":{large embedded document}   
   }
}

{
"_id":2
"a":1,
"resultsOfComputation":{
   "c":{large embedded document}
   }
}

Run Code Online (Sandbox Code Playgroud)

我想得到一个结果,告诉我文档是否包含特定字段.例如,我想知道它是否包含字段"resultsOfComputation.d",无论该字段的值是什么.

查询"resultsOfComputation.d"的结果示例如下:

{
"_id":1
"a":1,
"resultsOfComputation":{
   "d":true   
   }
}

{
"_id":2
"resultsOfComputation":{
   "d":false
   }
}

Run Code Online (Sandbox Code Playgroud)

如果"resultsOfComputation.d"不在文档中,它也可以是未定义的,这也可以:

{
"_id":1
"a":1,
"resultsOfComputation":{
   "d":true   
   }
}

{
"_id":2
"a":1,
"resultsOfComputation":{}
}

Run Code Online (Sandbox Code Playgroud)

通常,想法是获得文档的所有根元素,但是对于所选择的(一个)计算结果仅获得真/假/未定义,因为计算的结果是大的嵌入文档.

mongodb

Mar*_*cel

2016 08-02

5
推荐指数

1
解决办法

1807
查看次数

Ipython笔记本,如何设置正确的内核路径

在Windows 7 64bit上运行ipyhton笔记本并使用python 2内核启动笔记本时出现错误:

Traceback (most recent call last):
  File "C:\Users\USER1\Anaconda2\lib\site-packages\notebook\base\handlers.py", line 436, in wrapper
    result = yield gen.maybe_future(method(self, *args, **kwargs))
  File "C:\Users\USER1\Anaconda2\lib\site-packages\notebook\services\sessions\handlers.py", line 56, in post
    model = sm.create_session(path=path, kernel_name=kernel_name)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\notebook\services\sessions\sessionmanager.py", line 66, in create_session
    kernel_name=kernel_name)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\notebook\services\kernels\kernelmanager.py", line 84, in start_kernel
    **kwargs)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\jupyter_client\multikernelmanager.py", line 109, in start_kernel
    km.start_kernel(**kwargs)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\jupyter_client\manager.py", line 244, in start_kernel
    **kw)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\jupyter_client\manager.py", line 190, in _launch_kernel
    return launch_kernel(kernel_cmd, **kw)
  File "C:\Users\USER1\Anaconda2\lib\site-packages\jupyter_client\launcher.py", line 115, in launch_kernel
    proc = Popen(cmd, **kwargs)
  File "C:\Users\USER1\Anaconda2\lib\subprocess.py", …

Run Code Online (Sandbox Code Playgroud)

path ipython anaconda jupyter jupyter-notebook

Mar*_*cel

2016 05-23

2
推荐指数

1
解决办法

6430
查看次数

Mongodb为每个组找到具有最大特定字段值的文档(argmax)

在我的aggreagate管道中执行展开后,我有中间结果,例如:

[
{_id:1, precision:0.91, recall:0.71, other fields...},
{_id:1, precision:0.71, recall:0.81, other fields...},
{_id:1, precision:0.61, recall:0.91, other fields...},
{_id:2, precision:0.82, recall:0.42, other fields...},
{_id:2, precision:0.72, recall:0.52, other fields...},
{_id:2, precision:0.62, recall:0.62, other fields...}
]

Run Code Online (Sandbox Code Playgroud)

现在我想通过_id对文档进行分组,然后在每个组中查找最大召回的文档,并获取此文档的召回,精度和_id.

结果将是:

[
    {_id:1, precisionOfDocWithMaxRecall:0.61, maxRecall:0.91},
    {_id:2, precisionOfDocWithMaxRecall:0.62, maxRecall:0.62}
]

Run Code Online (Sandbox Code Playgroud)

我已设法使用group和max但没有精度字段获得结果.

mongodb mongodb-query aggregation-framework

Mar*_*cel

2017 07-02

2
推荐指数

1
解决办法

1415
查看次数

当 URL 的查询参数发生更改时，如何调度 NGRX 操作？

当 URL 的查询参数发生变化时，您将如何调度 ngrx 操作？我有一个带有参数的URL（例如http://myapp.my/documents?startdate=2015-04-04&enddate=2015-06-06），当用户单击后退（前进）浏览器按钮时，Angular 将导航到上一个（下一个）URL 可能具有不同的搜索参数。所以我需要调度一个“加载文档”操作来从数据库加载相应的文档。我可以看到多种方法来做到这一点。

我可以将 ngrx @Effect 绑定到 Angular-router“navigationend”Observable 并检查 URL 是否匹配“/documents”。效果将调度加载文档操作
我可以将 ngrx @Effect 绑定到 ngrx/router-store 生成的 ROUTER_NAVIGATION 操作并执行操作。这里的问题是 ROUTER_NAVIGATION 不能保证用户真正到达预期的页面（例如他们可能被路由防护阻止），
我可以在“文档”路径的 canActivate() 保护中分派操作 - 也许其他问题隐藏在这里（编辑：测试了这个，它不起作用，因为 canActivate 只被调用一次。当 URL 参数更改时，它不会再次被调用。）
有更好的办法吗？

角度 5，ngrx 4。

ngrx angular

Mar*_*cel

2018 01-09

2
推荐指数

1
解决办法

6569
查看次数