小编Abh*_*his的帖子

在Python数据框中复制Excel计算

你好,我正在尝试=C2*A3^B3在 python 数据框中复制这个 excel 计算。但无法将前一行的计算字段放入我的计算中。

Excel数据

公式

输入数据 :

 ftax        stax
 1.000000000     0   
 0.999959316     3   
 0.999959316     1   
 0.999959316     3   
 0.999959316     3   
 0.999959316     1
Run Code Online (Sandbox Code Playgroud)

如果我将值放入 excel 并应用此公式,我会得到以下结果并希望在 python 中复制相同的结果

=C2*A3^B3在单元格 C3 和 =A2^B2单元格 C2 中

期望的输出:

ftax         stax   cal_field
1.000000000  0      1.000000000
0.999959316  3      0.999877952
0.999959316  1      0.999837272
0.999959316  3      0.999715244
0.999959316  3      0.999593230
0.999959316  1      0.999552562
Run Code Online (Sandbox Code Playgroud)

我尝试过的示例代码

nav_df = pd.DataFrame(nav_cal)
nav_df[cal_filed_1] = nav_df[cal_filed].shift(1).fillna("1")

print (nav_df[cal_filed]*(nav_df[cal_filed_1])**nav_df[9]) #I think this is not the right way
Run Code Online (Sandbox Code Playgroud)

我得到以下输出:

      ftax     stax cal_field
0 …
Run Code Online (Sandbox Code Playgroud)

python excel pandas

2
推荐指数
1
解决办法
282
查看次数

问题:Spark shell中的Scala代码用于从Hbase检索数据

我们试图在Spark shell中执行一个简单的Scala代码来从Hbase中检索数据.Hadoop环境启用了Kerberos,我们确保执行kinit.

调用Spark Shell的步骤:

MASTER=yarn-client

DRIVER_CLASSPATH="/opt/cloudera/parcels/CDH/lib/hbase/lib/*"
DRIVER_LIBRARY_PATH="/opt/cloudera/parcels/CDH/lib/hadoop/lib/native"

spark-shell --driver-class-path "$DRIVER_CLASSPATH" --driver-library-path "$DRIVER_LIBRARY_PATH" --driver-memory 10G --executor-memory 15G --executor-cores 8 --num-executors 3 --master $MASTER
Run Code Online (Sandbox Code Playgroud)

码:

import org.apache.hadoop.fs._
import org.apache.hadoop.hbase._
import org.apache.hadoop.hbase.client._
import org.apache.hadoop.hbase.io._
import org.apache.hadoop.hbase.mapreduce._
import org.apache.hadoop.hbase.util._
import org.apache.spark._

val hc = HBaseConfiguration.create
hc.addResource(new Path("file:///opt/cloudera/parcels/CDH/lib/hbase/conf/hbase-site.xml"))

hc.addResource(new Path("file:///opt/cloudera/parcels/CDH/lib/hbase/conf/core-site.xml"))

hc.set(TableInputFormat.INPUT_TABLE, "poc-customers")
val rdd = sc.newAPIHadoopRDD(hc, classOf[TableInputFormat], classOf[ImmutableBytesWritable], classOf[Result])

rdd.count
Run Code Online (Sandbox Code Playgroud)

以下是ERROR

org.apache.hadoop.hbase.client.RetriesExhaustedException: Can't get the location
        at org.apache.hadoop.hbase.client.RpcRetryingCallerWithReadReplicas.getRegionLocations(RpcRetryingCallerWithReadReplicas.java:308)
        at org.apache.hadoop.hbase.client.ScannerCallableWithReplicas.call(ScannerCallableWithReplicas.java:149)
        at org.apache.hadoop.hbase.client.ScannerCallableWithReplicas.call(ScannerCallableWithReplicas.java:57)
        at org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithoutRetries(RpcRetryingCaller.java:200)
        at org.apache.hadoop.hbase.client.ClientScanner.call(ClientScanner.java:293)
        at org.apache.hadoop.hbase.client.ClientScanner.nextScanner(ClientScanner.java:268)
        at org.apache.hadoop.hbase.client.ClientScanner.initializeScannerInConstruction(ClientScanner.java:140)
        at org.apache.hadoop.hbase.client.ClientScanner.<init>(ClientScanner.java:135)
        at org.apache.hadoop.hbase.client.HTable.getScanner(HTable.java:888)
        at …
Run Code Online (Sandbox Code Playgroud)

hadoop hbase kerberos apache-spark

1
推荐指数
1
解决办法
4563
查看次数

选取6,7,8,13行,然后使用perl将它们合并为一行

我试图从文本文件下面的15间隔之后读取第6,第7,第8和第13行.

REPORT ID    : XZ-12                     ABC SERVICES COMPANY                     RUN DATE: 08/05/2014
PROGRAM      : XYZ                            SYSTEM                              PAGE: 0001
BUSINESS DAY : 08/04/14                 MONTHLY  REPORT                           TIME: 06:28:35

NAME          : FUND11
SRC NUMBER  : 3978
JOB CODE      : 80

    ABC                   DEF               RISK                AZ FUND             ASZ            ADS
    FEE                  FEE                FEE                 FEE                 FEE             FEE
================== ================== ================== ================== ================== ==================
              0.00               0.00               0.00               0.00               0.00               141.00

DATE: 08/05/2014                              

REPORT ID    : XZ-12                    ABC SERVICES COMPANY                      RUN DATE: 08/05/2014
PROGRAM      : XYZ                          SYSTEM                                PAGE: 0002
BUSINESS …
Run Code Online (Sandbox Code Playgroud)

perl

0
推荐指数
1
解决办法
39
查看次数

标签 统计

apache-spark ×1

excel ×1

hadoop ×1

hbase ×1

kerberos ×1

pandas ×1

perl ×1

python ×1