Fra*_*urt 1 python windows nlp summarization
我想pyrouge在Microsoft Windows上使用python包.该软件包未提供有关如何在Microsoft Windows上安装它的任何说明.我怎么能这样做?
以下说明在Windows 7 SP1 x64 Ultimate和python 3.5 x64(Anaconda)上进行了测试.
1)在cmd.exe,运行
pip install pyrouge
Run Code Online (Sandbox Code Playgroud)
2)下载ROUGE-1.5.5.您可以从https://github.com/andersjo/pyrouge/tree/master/tools/ROUGE-1.5.5下载它
3)pyrouge附带一个名为python脚本pyrouge_set_rouge_path(由于某种原因它没有文件扩展名),您需要运行该脚本以指向pyrouge所在的目录ROUGE-1.5.5.您需要找到pyrouge_set_rouge_path,这通常位于python Scripts目录中.
从运行下面的命令cmd.exe,适当地更换目录中pyrouge_set_rouge_path和ROUGE-1.5.5:
python C:\Anaconda\envs\py35\Scripts\pyrouge_set_rouge_path C:\pyrouge-master\tools\ROUGE-1.5.5
Run Code Online (Sandbox Code Playgroud)
4)pyrouge现在应该能够初始化一个Rouge155对象.你可以运行以下python脚本,它应该没有错误:
from pyrouge import Rouge155
r = Rouge155()
Run Code Online (Sandbox Code Playgroud)
5)如果你没有perl.exe,你需要安装它(因为pyrouge只是原始ROUGE脚本的包装,它是用Perl编写的)你可以安装http://strawberryperl.com
确保perl.exe二进制文件位于Path系统环境变量中,例如使用which perl:
要在Path系统环境变量中添加perl :
最后,为了避免这种错误:
一种方法是复制C:\Strawberry\c\bin\*.dll到C:\Strawberry\perl\bin\*.dll.
6)运行时防止出现以下错误消息pyrouge:
Cannot open exception db file for reading: C:\Anaconda\pyrouge-master\tools\ROUGE-1.5.5\data/WordNet-2.0.exc.db
Run Code Online (Sandbox Code Playgroud)
你应该删除\RELEASE-1.5.5\data\WordNet-2.0.exc.db,然后从cmd.exe:
cd RELEASE-1.5.5\data\
perl WordNet-2.0-Exceptions/buildExeptionDB.pl ./WordNet-2.0-Exceptions ./smart_common_words.txt ./WordNet-2.0.exc.db
Run Code Online (Sandbox Code Playgroud)
7)打开C:\Anaconda\envs\py35\Lib\site-packages\pyrouge\Rouge155.py(或安装在任何地方pyrouge),转到功能def evaluate(self, system_id=1, rouge_args=None)(在我写这个答案的时候它在319行),并在command.insert(0, 'perl ')之前添加self.log.info("Running ROUGE with command {}".format(" ".join(command))).(如果你不这样做,你会得到的OSError: [WinError 193] %1 is not a valid Win32 application,这与你未完成上述某些步骤时所获得的错误信息相同).
8)那时pyrouge应该工作正常.不要试图跑python -m pyrouge.test,它是马车.相反,您可以按如下方式测试它:
some_folder:
? rouge.py
?
????model_summaries
? text.A.001.txt
?
????system_summaries
text.001.txt
Run Code Online (Sandbox Code Playgroud)
rouge.py 包含:
from pyrouge import Rouge155
r = Rouge155()
r.system_dir = 'system_summaries'
r.model_dir = 'model_summaries'
r.system_filename_pattern = 'text.(\d+).txt'
r.model_filename_pattern = 'text.[A-Z].#ID#.txt'
output = r.convert_and_evaluate()
print(output)
output_dict = r.output_to_dict(output)
Run Code Online (Sandbox Code Playgroud)
text.A.001.txt 包含:
preprocess my summaries, then run ROUGE
Run Code Online (Sandbox Code Playgroud)
text.001.txt 包含:
I only want to preprocess my summaries and then run ROUGE on my own
Run Code Online (Sandbox Code Playgroud)
运行时输出rouge.py:
2017-10-31 21:55:37,239 [MainThread ] [INFO ] Writing summaries.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Processing summaries. Saving system files to C:\Users\Francky\AppData\Local\Temp\tmpmh72hoxa\system and model files to C:\Users\Francky\AppData\Local\Temp\tmpmh72hoxa\model.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Processing files in system_summaries.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Processing text.001.txt.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Saved processed files to C:\Users\Francky\AppData\Local\Temp\tmpmh72hoxa\system.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Processing files in model_summaries.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Processing text.A.001.txt.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Saved processed files to C:\Users\Francky\AppData\Local\Temp\tmpmh72hoxa\model.
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Written ROUGE configuration to C:\Users\Francky\AppData\Local\Temp\tmpgx71qygq\rouge_conf.xml
2017-10-31 21:55:37,249 [MainThread ] [INFO ] Running ROUGE with command perl C:\Anaconda\pyrouge-master\tools\ROUGE-1.5.5\ROUGE-1.5.5.pl -e C:\Anaconda\pyrouge-master\tools\ROUGE-1.5.5\data -c 95 -2 -1 -U -r 1000 -n 4 -w 1.2 -a -m C:\Users\Francky\AppData\Local\Temp\tmpgx71qygq\rouge_conf.xml
command: ['C:\\Anaconda\\pyrouge-master\\tools\\ROUGE-1.5.5\\ROUGE-1.5.5.pl', '-e', 'C:\\Anaconda\\pyrouge-master\\tools\\ROUGE-1.5.5\\data', '-c', '95', '-2', '-1', '-U', '-r', '1000', '-n', '4', '-w', '1.2', '-a', '-m', 'C:\\Users\\Francky\\AppData\\Local\\Temp\\tmpgx71qygq\\rouge_conf.xml']
---------------------------------------------
1 ROUGE-1 Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-1 Average_P: 0.42857 (95%-conf.int. 0.42857 - 0.42857)
1 ROUGE-1 Average_F: 0.60000 (95%-conf.int. 0.60000 - 0.60000)
---------------------------------------------
1 ROUGE-2 Average_R: 0.80000 (95%-conf.int. 0.80000 - 0.80000)
1 ROUGE-2 Average_P: 0.30769 (95%-conf.int. 0.30769 - 0.30769)
1 ROUGE-2 Average_F: 0.44444 (95%-conf.int. 0.44444 - 0.44444)
---------------------------------------------
1 ROUGE-3 Average_R: 0.50000 (95%-conf.int. 0.50000 - 0.50000)
1 ROUGE-3 Average_P: 0.16667 (95%-conf.int. 0.16667 - 0.16667)
1 ROUGE-3 Average_F: 0.25000 (95%-conf.int. 0.25000 - 0.25000)
---------------------------------------------
1 ROUGE-4 Average_R: 0.00000 (95%-conf.int. 0.00000 - 0.00000)
1 ROUGE-4 Average_P: 0.00000 (95%-conf.int. 0.00000 - 0.00000)
1 ROUGE-4 Average_F: 0.00000 (95%-conf.int. 0.00000 - 0.00000)
---------------------------------------------
1 ROUGE-L Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-L Average_P: 0.42857 (95%-conf.int. 0.42857 - 0.42857)
1 ROUGE-L Average_F: 0.60000 (95%-conf.int. 0.60000 - 0.60000)
---------------------------------------------
1 ROUGE-W-1.2 Average_R: 0.69883 (95%-conf.int. 0.69883 - 0.69883)
1 ROUGE-W-1.2 Average_P: 0.42857 (95%-conf.int. 0.42857 - 0.42857)
1 ROUGE-W-1.2 Average_F: 0.53131 (95%-conf.int. 0.53131 - 0.53131)
---------------------------------------------
1 ROUGE-S* Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-S* Average_P: 0.16484 (95%-conf.int. 0.16484 - 0.16484)
1 ROUGE-S* Average_F: 0.28303 (95%-conf.int. 0.28303 - 0.28303)
---------------------------------------------
1 ROUGE-SU* Average_R: 1.00000 (95%-conf.int. 1.00000 - 1.00000)
1 ROUGE-SU* Average_P: 0.19231 (95%-conf.int. 0.19231 - 0.19231)
1 ROUGE-SU* Average_F: 0.32258 (95%-conf.int. 0.32258 - 0.32258)
Run Code Online (Sandbox Code Playgroud)
如果您不执行第3步,则运行from pyrouge import Rouge155; r = Rouge155()将收到以下错误消息
Traceback (most recent call last):
File "C:\Users\Franck\Documents\rouge.py", line 8, in <module>
r = Rouge155()
File "C:\Anaconda\envs\py35\lib\site-packages\pyrouge\Rouge155.py", line 88, in __init__
self.__set_rouge_dir(rouge_dir)
File "C:\Anaconda\envs\py35\lib\site-packages\pyrouge\Rouge155.py", line 402, in __set_rouge_dir
self._home_dir = self.__get_rouge_home_dir_from_settings()
File "C:\Anaconda\envs\py35\lib\site-packages\pyrouge\Rouge155.py", line 416, in __get_rouge_home_dir_from_settings
with open(self._settings_file) as f:
FileNotFoundError: [Errno 2] No such file or directory: 'C:\\Users\\Franck\\AppData\\Roaming\\pyrouge\\settings.ini'
Run Code Online (Sandbox Code Playgroud)