我想使用pyspark.mllib.stat.Statistics.corr函数来计算两列pyspark.sql.dataframe.DataFrame对象之间的相关性.corr函数需要采取rdd的Vectors对象.如何将一列df['some_name']转换rdd为Vectors.dense对象?
python apache-spark apache-spark-sql pyspark apache-spark-mllib
好吧,伙计们,我今天在接受采访时被问到这个问题,它是这样的:
"告诉一个二叉树是否包含在另一个二叉树中(包含节点的结构和值)"
我想到了以下方法:
将较大的树展平为:
{{{-}a{-}}b{{-}c{-}}}d{{{-}e{{-}f{-}}}g{{{-}h{-}}i{{-}j{-}}}}
Run Code Online (Sandbox Code Playgroud)
(我确实为此编写了代码,{-}意味着空左或右子树,每个子树都包含在{} paranthesis中)
现在对于较小的子树,我们需要匹配这种模式:
{{.*}e{.*}}g{{{.*}h{.*}}i{{.*}j{.*}}}
Run Code Online (Sandbox Code Playgroud)
其中{.*}表示空子树或非空子树.
当时我想,这将是java中一个简单的正则表达式模式匹配问题,但我是竹子.实际上现在我觉得,我刚刚改变了问题(创造了另一个怪物).
是否有一个简单的正则表达式衬里匹配这些模式?我知道可能有其他方法可以解决这个问题,这可能不是最好的方法.我只是想知道这是否可以解决.
我正在尝试使用以下方法在Google云端实例上安装gensim:
pip3安装gensim
这是我尝试导入gensim时的堆栈跟踪:
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python3.4/dist-packages/gensim/__init__.py", line 6, in <module>
from gensim import parsing, matutils, interfaces, corpora, models, similarities, summarization
File "/usr/local/lib/python3.4/dist-packages/gensim/models/__init__.py", line 7, in <module>
from .coherencemodel import CoherenceModel
File "/usr/local/lib/python3.4/dist-packages/gensim/models/coherencemodel.py", line 30, in <module>
from gensim.models.wrappers import LdaVowpalWabbit, LdaMallet
File "/usr/local/lib/python3.4/dist-packages/gensim/models/wrappers/__init__.py", line 5, in <module>
from .ldamallet import LdaMallet
File "/usr/local/lib/python3.4/dist-packages/gensim/models/wrappers/ldamallet.py", line 43, in <module>
from smart_open import smart_open
File "/usr/local/lib/python3.4/dist-packages/smart_open/__init__.py", line 1, in <module>
from .smart_open_lib import …Run Code Online (Sandbox Code Playgroud) 我搜索了很多以解决这个问题在互联网上,但无法找到具体的解决方案,尽管之前详细讨论过这个问题.
查询很简单.我的javascript动态地将iframe添加到网页(显示反馈表单).问题在于,"在回答"之后,现在当用户点击浏览器的后退按钮时,iframe而不是浏览器窗口受到影响,即再次显示问卷.我希望浏览器后退按钮正常运行.
这种行为真的很烦人,我在解决这个问题时遇到了麻烦.
我正在使用Firefox.
期待回复.如果我想提供更多细节,请通知我.
谢谢,
我是Javascript编程的新手,这个人已经清楚地"干净了"我:( ........
以下代码段通过搜索和修改相应的文本节点来突出显示某些文本.使用Jquery功能完成文本节点搜索:
window.addEventListener("load",highlightSummarySentences,false);
function highlightSummarySentences() {
var docName = thisPage;
var numSentences = getCookie(docName+"Num");
var linkSentenceNum = getCookie(docName + 'LinkingSentence');
for(var i=0; i<numSentences; i++) {
var matchMe = getCookie(docName+i);
try {
if (matchMe && i==(linkSentenceNum)) {
highlightText(matchMe, clickedSentenceColour);
}
else if(matchMe){
highlightText(matchMe, summarySentenceColour);
}
} catch (e) {;
}
}
}
Run Code Online (Sandbox Code Playgroud)
for循环运行一次并突出显示相应的文本,然后退出并且页面变为完全白色.在错误控制台中看到以下错误:
错误:未定义getCookie
但我猜它不是getCookie.在上述事件之后,脚本只是拒绝识别任何函数或变量.我不知道可能导致脚本以这种方式运行的原因.我在firefox中开发.
请给我一个提示!如果我应该粘贴更多提供更多信息的代码,请告诉我.
谢谢,
Android初学者在这里.我正在玩ListViews,尝试动态创建它们而不是XML文件.我在代码中观察到以下奇怪的行为.
public class SettingsHolder extends Activity {
@Override
public void onCreate(Bundle savedInstanceState){
super.onCreate(savedInstanceState);
LinearLayout ll = new LinearLayout(this);
ListView lv = new ListView(this);
String[] values = new String[10];
for(int i=0;i<10;i++){
values[i] = ""+i;
}
ArrayAdapter<String> adapter = new ArrayAdapter<String>(this, R.layout.list_item, values);
lv.setAdapter(adapter);
lv.setOnItemClickListener(new OnItemClickListener(){
@Override
public void onItemClick(AdapterView<?> arg0, View arg1, int arg2,
long arg3) {
//Toast.makeText(getBaseContext(), ""+arg2, Toast.LENGTH_SHORT).show();
Log.d("DEBUG", ""+arg2);
}
});
ll.addView(lv);
setContentView(ll);
}
}
Run Code Online (Sandbox Code Playgroud)
基本上我首先创建一个LinearLayout对象,然后将ListView对象作为其子对象之一.我发现这样创建的列表项不可点击.但如果我写
setContentView(lv);
Run Code Online (Sandbox Code Playgroud)
代替
setContentView(ll);
Run Code Online (Sandbox Code Playgroud)
列表项是可点击的.有人可以解释一下吗?如果我必须以后一种方式实现我的课程,如何使列表项可以点击?我不想采用ListActivity方式.
list_item.xml
<?xml version="1.0" encoding="utf-8"?>
<TextView xmlns:android="http://schemas.android.com/apk/res/android"
android:layout_width="fill_parent"
android:layout_height="fill_parent" …Run Code Online (Sandbox Code Playgroud) 我不会发布这个,除非我真的感到沮丧.我试图在一个非常简单的应用程序中使用Hibernate 4.1.1配置C3P0,但我得到的是一个未找到类的异常.我在我的类路径中包含了C3P0特定的jar文件.休眠-c3p0-3.5.2-Final.jar
hibernate.properties:
hibernate.connection.provider_class=org.hibernate.service.jdbc.connections.internal.C3P0ConnectionProvider
hibernate.c3p0.acquire_increment=1
hibernate.c3p0.idle_test_period=180
hibernate.c3p0.max_size=60
hibernate.c3p0.max_statements=0
hibernate.c3p0.min_size=10
Run Code Online (Sandbox Code Playgroud)
堆栈跟踪:
28 Jul, 2012 9:23:08 PM org.hibernate.service.jdbc.connections.internal.ConnectionProviderInitiator instantiateExplicitConnectionProvider
INFO: HHH000130: Instantiating explicit connection provider: org.hibernate.service.jdbc.connections.internal.C3P0ConnectionProvider
Exception in thread "main" org.hibernate.service.spi.ServiceException: Unable to create requested service [org.hibernate.service.jdbc.connections.spi.ConnectionProvider]
at org.hibernate.service.internal.AbstractServiceRegistryImpl.createService(AbstractServiceRegistryImpl.java:186)
at org.hibernate.service.internal.AbstractServiceRegistryImpl.initializeService(AbstractServiceRegistryImpl.java:150)
at org.hibernate.service.internal.AbstractServiceRegistryImpl.getService(AbstractServiceRegistryImpl.java:131)
at org.hibernate.engine.jdbc.internal.JdbcServicesImpl.buildJdbcConnectionAccess(JdbcServicesImpl.java:223)
at org.hibernate.engine.jdbc.internal.JdbcServicesImpl.configure(JdbcServicesImpl.java:89)
at org.hibernate.service.internal.StandardServiceRegistryImpl.configureService(StandardServiceRegistryImpl.java:75)
at org.hibernate.service.internal.AbstractServiceRegistryImpl.initializeService(AbstractServiceRegistryImpl.java:159)
at org.hibernate.service.internal.AbstractServiceRegistryImpl.getService(AbstractServiceRegistryImpl.java:131)
at org.hibernate.cfg.SettingsFactory.buildSettings(SettingsFactory.java:71)
at org.hibernate.cfg.Configuration.buildSettingsInternal(Configuration.java:2277)
at org.hibernate.cfg.Configuration.buildSettings(Configuration.java:2273)
at org.hibernate.cfg.Configuration.buildSessionFactory(Configuration.java:1742)
at org.hibernate.cfg.Configuration.buildSessionFactory(Configuration.java:1782)
at eko.test.hibernate.Hibernate4Test.main(Hibernate4Test.java:17)
Caused by: org.hibernate.HibernateException: Could not instantiate connection provider [org.hibernate.service.jdbc.connections.internal.C3P0ConnectionProvider]
at org.hibernate.service.jdbc.connections.internal.ConnectionProviderInitiator.instantiateExplicitConnectionProvider(ConnectionProviderInitiator.java:192)
at org.hibernate.service.jdbc.connections.internal.ConnectionProviderInitiator.initiateService(ConnectionProviderInitiator.java:114)
at org.hibernate.service.jdbc.connections.internal.ConnectionProviderInitiator.initiateService(ConnectionProviderInitiator.java:54)
at org.hibernate.service.internal.StandardServiceRegistryImpl.initiateService(StandardServiceRegistryImpl.java:69) …Run Code Online (Sandbox Code Playgroud) 一个简单的问题:我们什么时候可以阻止或限制单词的使用?词干对所有nlp进程都有帮助吗?或者在某些应用程序中使用完整形式的单词可能会导致更好的准确性或准确性?
我是一个新手.我创建了一个集群并ssh到主节点.当我尝试将文件从s3:// my-bucket-name /复制到pig中的本地文件:// home/hadoop文件夹时使用:
cp s3://my-bucket-name/path/to/file file://home/hadoop
Run Code Online (Sandbox Code Playgroud)
我收到错误:
2013-06-08 18:59:00,267 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 29 99:意外的内部错误.AWS Access Key ID和Secret Access Key必须分别作为s3 URL的用户名或密码,或者通过设置fs.s3.awsAccessKeyId或fs.s3.awsSecretAccessKey属性(分别).
我甚至不能进入我的s3桶.我没有成功设置AWS_ACCESS_KEY和AWS_SECRET_KEY.此外,我找不到猪的配置文件来设置适当的字段.
有什么帮助吗?
编辑:我尝试使用完整的s3n:// uri在pig中加载文件
grunt> raw_logs = LOAD 's3://XXXXX/input/access_log_1' USING TextLoader a
s (line:chararray);
grunt> illustrate raw_logs;
Run Code Online (Sandbox Code Playgroud)
我收到以下错误:
2013-06-08 19:28:33,342 [main] INFO org.apache.pig.backend.hadoop.executionengi ne.HExecutionEngine - 连接到hadoop文件系统:file:/// 2013-06-08 19:28: 33,404 [main] INFO org.apache.pig.backend.hadoop.executionengi ne.mapReduceLayer.MRCompiler - 文件级联阈值:100乐观?fal se 2013-06-08 19:28:33,404 [main] INFO org.apache.pig.backend.hadoop.executionengi ne.mapReduceLayer.MultiQueryOptimizer - 优化前的MR计划大小:1 2013-06-08 19:28:33,405 [主要] INFO org.apache.pig.backend.hadoop.executionengi ne.mapReduceLayer.MultiQueryOptimizer - 优化后的MR计划大小:1 2013-06-08 19:28:33,405 [main] INFO org.apache.pig.tools. …
我通过URL传递数据,然后使用PHP的$ _GET []函数访问它.如果URL中有某个"#",那么GET请求似乎会截断字符串.例如,
http://example.com/yyy.php?version=0.88&value=ART:C_Sharp_%28programming_language%29@Multi-paradigm_programming_language@Influenced D,F#,Java 5,Nemerle,Vala平台公共语言基础结构许可证CLR专有的常用文件扩展名.cs网站C在Wikibooks上的Sharp编程C#(发音为/siːːrp/ see sharp)是一种多范式编程语言,包括命令式,声明式,功能性,泛型,面向对象(基于类)和面向组件的编程规则. @ 10902
$_GET['value'] 收益:
ART:C_Sharp_(programming_language)@ Multi-paradigm_programming_language @受影响的D,F
有办法避免这种情况吗?我应该从发布到URL的值中删除所有#?
使用XMLHttpRequest()从客户端javascript代码访问URL位置.来自javascript功能encodeURIComponent()的编码字符串是,
ART%3AC_Sharp_%2528programming_language%2529%40Multi-paradigm_programming_language%40Influenced%20D%2C%20F%23%2C%20Java%205%2C%20Nemerle%2C%20Vala%20Platform%20Common%20Language%20Infrastructure%20License%20CLR%20Proprietary% 20Usual%20file%20extensions%20.cs%20Website%20C%20Sharp%20Programming%20AT%20Wikibooks%20C%23%20(发音%20%2F%CB%88si%CB%90%20%CB%88%CA% 83%C9%91rp%2F%20see%20sharp)%图20是%20A%20multi范%20programming%20language%20encompassing%20imperative%2C%20declarative%2C%20functional%2C%20generic%2C%20object为本%20(类基)%2C%20于是%的面向20component-%20programming%20disciplines.%8681
'yyy.php'记录的解码输出:
ART:C_Sharp_%28programming_language%29 @ Multi-paradigm_programming_language @受影响的D,F#,Java 5,Nemerle,Vala平台公共语言基础设施许可证CLR专有常用文件扩展名.cs网站C在Wikibooks C#进行Sharp编程(发音为/ËsiËËʃÉ'rp/ see sharp)是一种多范式编程语言,包括命令式,声明式,功能性,泛型,面向对象(基于类)和面向组件的编程规则.@ 8681
不使用encodeURIComponent()记录输出,应该是预期的结果:
ART:C_Sharp_(programming_language)@ Multi-paradigm_programming_language @Delled D,F#,Java 5,Nemerle,Vala平台公共语言基础设施许可证CLR专有常用文件扩展名.cs网站C在Wikibooks C#进行Sharp编程(发音为/siːːrp/ see sharp)是一种多范式编程语言,包括命令式,声明式,功能性,泛型,面向对象(基于类)和面向组件的编程规则.@ 8681
有没有简单的方法来替换字符串中所有出现的(整个)单词?我目前正在使用它,它不是很优雅:
public static String replace(String input, String toReplace,
String replacement){
if(input==null) throw new NullPointerException();
input = input.replace(" "+toReplace+" ", " "+replacement+" ");
input = input.replaceAll("^"+toReplace+" ", replacement+" ");
input = input.replaceAll(" "+toReplace+"$", " "+replacement);
return input;
}
Run Code Online (Sandbox Code Playgroud)
此外,正则表达式"^"+toReplace+" "不是正则表达式安全.例如:当它可能包含如下字符[或(等.
编辑:
这个代码有什么原因:
public static String replace(String input, String toReplace,
String replacement){
if(input==null) throw new NullPointerException();
input = input.replace(" "+toReplace+" ", " "+replacement+" ");
input = input.replaceAll(Pattern.quote("^"+toReplace+" "), replacement+" ");
input = input.replaceAll(Pattern.quote(" "+toReplace+"$"), " "+replacement); …Run Code Online (Sandbox Code Playgroud) javascript ×3
firefox ×2
java ×2
python ×2
regex ×2
amazon-s3 ×1
android ×1
apache-pig ×1
apache-spark ×1
binary-tree ×1
c3p0 ×1
dom ×1
gensim ×1
hadoop ×1
hdfs ×1
hibernate ×1
iframe ×1
jquery ×1
listview ×1
nlp ×1
php ×1
pip ×1
puzzle ×1
pyspark ×1
replace ×1
stanford-nlp ×1
string ×1
url ×1