小编d8a*_*nja的帖子

何时申请(pd.to_numeric)以及何时在python中使用astype(np.float64)？

我有一个名为pandas的DataFrame对象xiv,它有一列int64Volume测量值.

In[]: xiv['Volume'].head(5)
Out[]: 

0    252000
1    484000
2     62000
3    168000
4    232000
Name: Volume, dtype: int64

Run Code Online (Sandbox Code Playgroud)

我已经阅读了其他帖子(比如这个和这个),提出了以下解决方案.但是,当我使用任何一种方法时,它似乎不会改变dtype底层数据:

In[]: xiv['Volume'] = pd.to_numeric(xiv['Volume'])

In[]: xiv['Volume'].dtypes
Out[]: 
dtype('int64')

Run Code Online (Sandbox Code Playgroud)

要么...

In[]: xiv['Volume'] = pd.to_numeric(xiv['Volume'])
Out[]: ###omitted for brevity###

In[]: xiv['Volume'].dtypes
Out[]: 
dtype('int64')

In[]: xiv['Volume'] = xiv['Volume'].apply(pd.to_numeric)

In[]: xiv['Volume'].dtypes
Out[]: 
dtype('int64')

Run Code Online (Sandbox Code Playgroud)

我也尝试制作一个单独的pandas Series并使用上面列出的方法在该系列上并重新分配给x['Volume']obect,这是一个pandas.core.series.Series对象.

但是,我已经使用numpy包的float64类型找到了解决这个问题的方法- 这有效,但我不知道它为什么会有所不同.

In[]: xiv['Volume'] = xiv['Volume'].astype(np.float64)

In[]: xiv['Volume'].dtypes
Out[]: 
dtype('float64') …

Run Code Online (Sandbox Code Playgroud)

python types numpy dataframe pandas

d8a*_*nja

2017 05-23

30
推荐指数

1
解决办法

4万
查看次数

如果我已经有y比例,用逗号(和K？MM？)格式化ggplot2轴标签

我正在尝试格式化ggplot图的y轴标签的成本和收入(以千为单位)和印象数(以百万计)数据.

我的情节从31天前运行到"昨天",并使用该期间的最小值和最大值作为ylim(c(min,max))选项.仅显示成本示例,

library(ggplot2)
library(TTR)

set.seed(1984)

#make series
start <- as.Date('2016-01-01')
end <- Sys.Date()

days <- as.numeric(end - start)

#make cost and moving averages
cost <- rnorm(days, mean = 45400, sd = 11640)
date <- seq.Date(from = start, to = end - 1, by = 'day') 
cost_7 <- SMA(cost, 7)
cost_30 <- SMA(cost, 30)

df <- data.frame(Date = date, Cost = cost, Cost_7 = cost_7, Cost_30 = cost_30)


# set parameters for window
left <- end - 31
right …

Run Code Online (Sandbox Code Playgroud)

formatting r ggplot2

d8a*_*nja

2016 06-09

25
推荐指数

2
解决办法

4万
查看次数

什么类型的对象是R包？

可能是一个非常基本的问题,但朋友和我试图跑str(packge_name),R给我们一个错误.现在我正在看它,我想知道R包是否像.zip文件一样,它是一个对象集合,比如图片和歌曲,但不是图片或歌曲本身.

如果我试图用图像浏览器打开一个图片拉链,它就不会知道该做什么,直到我解压缩它 - 就像我不能打电话str(forecast)但我可以str(ts)在我将预测包加载到我的库中后调用. ..

谁能让我直截了当？

r object install.packages

d8a*_*nja

2015 01-14

22
推荐指数

3
解决办法

1633
查看次数

R Markdown文本中的换行符(不是代码块)

使用tufte_template rmarkdown文件,我试图创建一个新的段落(比如\newthought{},但没有上限.)我使用两个空格,在这里用*表示:

# Introduction

The Tufte-\LaTeX\ [^tufte_latex] document**
**
classes define a style similar to the style Edward Tufte uses in his books...

Run Code Online (Sandbox Code Playgroud)

但得到这个结果:

在此输入图像描述

我也尝试\n过代替第二对空格(**),但是pandoc会抛出错误.

pandoc.exe: Error producing PDF from TeX source
Error: pandoc document conversion failed with error 43

Run Code Online (Sandbox Code Playgroud)

最后,我尝试使用<br>标签,但这似乎没有任何效果 - 它不打印文本或打破PDF.

我想要一个没有缩进的新段落,类似于\newthought {},但没有大写......有没有办法？

使用sessionInfo()更新1:

> sessionInfo()
R version 3.1.2 (2014-10-31)
Platform: x86_64-w64-mingw32/x64 (64-bit)

locale:
[1] LC_COLLATE=English_United States.1252  LC_CTYPE=English_United States.1252   
[3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C                          
[5] LC_TIME=English_United States.1252    

attached base packages:
[1] …

Run Code Online (Sandbox Code Playgroud)

markdown r r-markdown tufte

d8a*_*nja

2015 05-06

12
推荐指数

3
解决办法

5万
查看次数

在Rust中,"阴影"和"可变性"之间有什么区别？

在Rust Book,Variables和Mutability的第3章中,我们对这个主题进行了几次迭代,以演示Rust中变量的默认,不可变行为:

fn main() {
    let x = 5;
    println!("The value of x is {}", x);
    x = 6;
    println!("The value of x is {}", x);
}

Run Code Online (Sandbox Code Playgroud)

哪个输出:

error[E0384]: cannot assign twice to immutable variable `x`
 --> src/main.rs:4:5
  |
2 |     let x = 5;
  |         -
  |         |
  |         first assignment to `x`
  |         help: make this binding mutable: `mut x`
3 |     println!("The value of x is {}", x);
4 |     x = …

Run Code Online (Sandbox Code Playgroud)

variables immutability mutability rust

d8a*_*nja

2018 11-10

9
推荐指数

1
解决办法

670
查看次数

有没有办法在使用DBI和dbGetQuery时超时MySql查询？

我意识到了

dbGetQuery comes with a default implementation that calls dbSendQuery, then dbFetch, ensuring that the result is always freed by dbClearResult.

和

dbClearResult frees all resources (local and remote) associated with a result set. In some cases (e.g., very large result sets) this can be a critical step to avoid exhausting resources (memory, file descriptors, etc.)

但我的团队刚刚经历了一个锁定的表,我们进入了MySQL kill pid,我想知道 - 有没有办法超时使用该DBI包提交的查询？

我正在寻找,找不到相应的

dbGetQuery(conn = connection, 'select stuff from that_table', timeout = 90)

我尝试了这个,并使用和不使用参数集来分析函数,它似乎没有做任何事情; 为什么会这样,如果dbClearResult总是在玩？

mysql timeout r jdbc r-dbi

d8a*_*nja

2018 06-17

8
推荐指数

1
解决办法

789
查看次数

buildspec、appspec 和滚动我自己的 AMI 之间有什么区别？

除了结构之外，我对buildspec.yml和文件之间的区别感到相当困惑。appspec.yml我的构建规范中的阶段命令似乎对部署环境没有太大影响，并且在运行过程（例如安装）时使用哪个 appspec 挂钩之间也存在类似的模糊性。只要该过程发生在需要它的阶段之前，我就看不到阶段之间的差异。（我已阅读规范参考文档；他们大多确认这些阶段的标题是出于其目的，但并不是真正的目的。）

举个例子，为什么每次我必须进行部署时，我不直接使用 Packer 烘焙自己的 AMI，然后启动一个新主机并附加 AMI？

packer aws-code-deploy aws-codepipeline aws-codestar cicd

d8a*_*nja

2022 12-25

7
推荐指数

1
解决办法

3778
查看次数

我可以使用抽象方法来导入 (Python) 熊猫数据的特定于文件的格式吗？

我有一个类FileSet有方法_process_series，它包含了一堆if-elif做块filetag的不同特异性格式pandas.Series：

    elif filetag == "EntityA":
        ps[filetag+"_Id"] = str(ps[filetag+"_Id"]).strip()
        ps[filetag+"_DateOfBirth"] = str(pd.to_datetime(ps[filetag+"_DateOfBirth"]).strftime('%Y-%m-%d')).strip()
        ps[filetag+"_FirstName"] = str(ps[filetag+"_FirstName"]).strip().capitalize()
        ps[filetag+"_LastName"] = str(ps[filetag+"_LastName"]).strip().capitalize()
        ps[filetag+"_Age"] = relativedelta(datetime.today(), datetime.strptime(ps[filetag+"_DateOfBirth"], "%Y-%m-%d")).years
        return ps

Run Code Online (Sandbox Code Playgroud)

我想在类中定义一个抽象format方法，并将这些格式块保存在单独的模块中，这些模块在_process_series为给定的filetag. 原谅伪代码，但类似于：

for tag in filetag:
    from my_formatters import tag+'_formatter' as fmt
    ps = self.format(pandas_series, fmt)
    return ps

Run Code Online (Sandbox Code Playgroud)

该模块将包含格式化块：

# my_formatters.EntityA_formatter
    ps[filetag+"_Id"] = str(ps[filetag+"_Id"]).strip()
    ps[filetag+"_DateOfBirth"] = str(pd.to_datetime(ps[filetag+"_DateOfBirth"]).strftime('%Y-%m-%d')).strip()
    ps[filetag+"_FirstName"] = str(ps[filetag+"_FirstName"]).strip().capitalize()
    ps[filetag+"_LastName"] = str(ps[filetag+"_LastName"]).strip().capitalize()
    ps[filetag+"_Age"] = relativedelta(datetime.today(), datetime.strptime(ps[filetag+"_DateOfBirth"], "%Y-%m-%d")).years …

Run Code Online (Sandbox Code Playgroud)

python format methods python-module pandas

d8a*_*nja

2021 08-09

6
推荐指数

1
解决办法

145
查看次数

pip表示卸载模块"未找到",但是pip列表显示了它们

随后pip list我显示了已安装模块的列表,其中包括scipy,numpy和pandas.

如果我是其中pip install -U...任何一个,它会检查它,看到它是最新的,然后返回提示.但是,如果我pip uninstall pandas或我在列表中看到的任何项目,它说

Can't uninstall numpy. No files were found to uninstall

我通过Enthought的Canopy Stack安装了与Python有关的所有内容.

python uninstall pip package-managers canopy

d8a*_*nja

2014 08-18

5
推荐指数

1
解决办法

1209
查看次数

从 Corr 矩阵返回最大相关性和行名称

我试图data.frame通过使用该cor函数在对象的每一列中找到最大相关性。假设这个对象看起来像

A <- rnorm(100,5,1)
B <- rnorm(100,6,1)
C <- rnorm(100,7,4)
D <- rnorm(100,4,2)
E <- rnorm(100,4,3)


M <- data.frame(A,B,C,D,E)
N <- cor(M)

Run Code Online (Sandbox Code Playgroud)

标签统计

r ×5

python ×3

pandas ×2

aws-code-deploy ×1

aws-codepipeline ×1

aws-codestar ×1

canopy ×1

cicd ×1

correlation ×1

dataframe ×1

format ×1

formatting ×1

ggplot2 ×1

immutability ×1

install.packages ×1

jdbc ×1

markdown ×1

matrix ×1

methods ×1

mutability ×1

mysql ×1

numpy ×1

object ×1

package-managers ×1

packer ×1

pip ×1

python-module ×1

r-dbi ×1

r-markdown ×1

rust ×1

timeout ×1

tufte ×1

types ×1

uninstall ×1

variables ×1

标签 统计

小编d8a_nja的帖子

标签统计