我有以下字符串:
1) there is a problem 2) There appears to be a bug 3) stuck on start screen. 
Run Code Online (Sandbox Code Playgroud)
我希望得到文本后1),2)和3).这就是我要找的东西:
['there is a problem', 'There appears to be a bug', 'stuck on start screen']
Run Code Online (Sandbox Code Playgroud)
我尝试使用re.split并拆分\d+,但这并没有给我我想要的东西.我想保持通用的解决方案,因此,如果万一有一个4)或5)以上我仍然可以检索的文本.  
任何帮助将不胜感激.
我有以下file.txt:
AAAA
BBBB
AAAA
CCCC
EEEE
AAAA
Run Code Online (Sandbox Code Playgroud)
我编写了一个脚本来计算重复次数,将它们从最高重复项排序到最低重复项并打印出来.喜欢 :
AAAA : 3
BBBB : 1
CCCC : 1
EEEE : 1
Run Code Online (Sandbox Code Playgroud)
该脚本是:
use v5.14;
use strict;
my %map;
chomp(my @chks = <FILE>);
foreach my $load (@chks) {
    $map{$load} += 1;
}
foreach my $key (sort keys %map) {
    say "$key : $map{$key} "
} 
Run Code Online (Sandbox Code Playgroud)
但输出结果如下:
 : 3
 : 1
 : 1
 : 1
Run Code Online (Sandbox Code Playgroud)
为什么它看不到$ key的值?
我有很多firefox会话管理器保存文件,名为*.session.and我想从文件中导出网址和标题,我写了正则表达式:
(?<=entries":\[{"url":"(?<link>.*?(?="))","title":"(?<content>.*?)(?=",")) 
Run Code Online (Sandbox Code Playgroud)
但似乎效果不好.它太匹配了.
文件派对
[SessionManager v2]
name=jjjjjjjjjjjjjjjjjj
timestamp=1368030038170
autosave=false  count=1/49  screensize=1366x768
{"windows":[{"tabs":[{"entries":[{"url":"http://blog.csdn.net/gisfarmer/article/details/4135975?1357376310","title":"????????C#????? - ?????? - ???? - CSDN.NET","ID":1673113085,"docshellID":36,"referrer":"http://blog.csdn.net/gisfarmer/article/details/4135975","docIdentifier":80,"children":[{"url":"about:blank","ID":1673113086,"docshellID":34,"docIdentifier":81},{"url":"about:blank","ID":1673113087,"docshellID":168,"docIdentifier":82},{"url":"about:blank","ID":1673113088,"docshellID":55,"docIdentifier":83},{"url":"about:blank","ID":1673113089,"docshellID":37,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":84},{"url":"about:blank","ID":1673113090,"docshellID":31,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":85},{"url":"about:blank","ID":1673113091,"docshellID":63,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":86},{"url":"about:blank","ID":1673113092,"docshellID":22,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":87},{"url":"about:blank","ID":1673113093,"docshellID":118,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":88},{"url":"about:blank","ID":1673113094,"docshellID":59,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":89},{"url":"about:blank","ID":1673113095,"docshellID":137,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQWh0dHA6Ly9ibG9nLmNzZG4ubmV0L2dpc2Zhcm1lci9hcnRpY2xlL2RldGFpbHMvNDEzNTk3NT8xMzU3Mzc2MzEwAAAAAAAAAAQAAAAHAAAADQAAAAf/////AAAAB/////8AAAAHAAAADQAAABQAAAAtAAAAFAAAACIAAAAUAAAAGwAAAC8AAAAHAAAAL/////8AAAAA/////wAAADcAAAAKAAAAFP////8BAAAAAAAAAAAAAQAAAAAAAA==","docIdentifier":90},{"url":"about:blank","ID":1673113096,"docshellID":254,"owner_b64":"CbflmEkNQj+opi5sTsh3UAAAAAAAAAAAwAAAAAAAAEYB3pRy0IA0EdOTmQAQS6D9QDlf4EV9GErbo/2vmMihrxEAAAAC/////wAAAFABAAAAQW
Run Code Online (Sandbox Code Playgroud)
和结果 

任何可以帮助的身体!
要在具有preg_match函数的文本中搜索特定字符串,如UE-42F5070,我正在尝试编写正则表达式.我无法编写"UE - ## F ####"的正则表达式模式,其中#是数字,与preg_match一起使用以在文本中找到它们.
字符串示例;
UE-32F5070
UE-50F8000
我正在重新措辞我的问题,以清除混乱!
我希望匹配如果一个字符串有特定的字母,我使用字符类:
[ACD]
Run Code Online (Sandbox Code Playgroud)
它完美无缺!
但如果字符串有两次或多次重复或两个单独的字母,我想匹配
例如:
 [AKL]应匹配:
ABCVL
AAGHF
KKUI
AKL
Run Code Online (Sandbox Code Playgroud)
但上述内容不应与以下内容相符:
ABCD
KHID
LOVE
Run Code Online (Sandbox Code Playgroud)
因为那些只有一次!
这就是我试图使用的原因:
[ACD]{2,}
Run Code Online (Sandbox Code Playgroud)
但它不起作用,可能它不是正确的正则表达式..正如一个正则表达式大师可以帮我解决这个难题吗?
谢谢
PS:我会在MYSQL上使用它 - 一种不同的方法也欢迎!但我喜欢使用正则表达式来实现更智能和更短的查询!
我一直在使用正则表达式替换从文本框值中删除所有标点符号并替换删除所留下的空格.它工作得很好,除了我意识到我需要它离开.和?在现场.这是我目前正在使用的.如何跳过这两个字符但删除所有其他字符?
脚本
var special = special.replace(/[^\w\s]|_/g, "").replace(/\s+/g, " ");
Run Code Online (Sandbox Code Playgroud) 我有以下字符串:
#key1=value1|#key2=value2|#key3=value3
Run Code Online (Sandbox Code Playgroud)
我正在使用此正则表达式来匹配键和值:
/#(.*)\=(.*)[\|\v\h|]/gU
Run Code Online (Sandbox Code Playgroud)
它适用于key1和key2,但不捕获key3
演示:https://regex101.com/r/eU6lV8/2
任何想法如何捕获key3?
我正在尝试从 Blogger 的回复中获取某些结果。我想知道我的博客名称。我将如何使用正则表达式来处理类似的事情?我尝试过谷歌搜索我的问题,但不幸的是,没有一个答案对我的情况有帮助。
所以我的回应看起来像这样:
\\x22http://emyblog.blogspot.com/
Run Code Online (Sandbox Code Playgroud)
所以它总是以 the\\x22http://开头并以.blogspot.com/
我已经尝试过以下回复:
regEx = re.findall(b"""\x22http://(.*)\.blogspot\.com""", r)
Run Code Online (Sandbox Code Playgroud)
但不幸的是它返回了一个空列表。关于如何解决这个问题有什么想法吗?
谢谢,
因此,我有一个由多个项目组成的字符串,并用“ +”或“-”分隔。该字符串包含5个元素:typeCode,versionCode,lengthCode,partCode,最后一部分是数据。每种类型的长度都需要是可变的,因此typeCode的长度可以为2,但是长度也可以为5。总字符串的最大长度为30个字符,但根据字符串的长度,它可以更短。最后的数据。当每个部分的长度固定(我已经使用过2个)时,我可以拆分它,但是我希望它是灵活的而不是硬编码的。
我想将每个字符串分成5个元素作为数组。如示例所示,每个元素的分隔符可以是“ +”或“-”。我不希望分隔符包含在任何元素中。在第一个元素之前或最后一个元素之后没有拆分char。
我还希望能够确定最后一个分隔符是“ +”还是“-”,并将其作为数组中的元素。
我相信可以使用RegEx使用String.split(separator)完成此操作,其中分隔符是JavaScript中的字符串或RegEx,但我不确定。我不知道RegEx。
输入和所需输出的示例:
01 + 03 + 03 + 00-3f2aec1f6b088a1c
= [“ 01”,“ 03”,“ 03”,“ 00”,“-”,“ 3f2aec1f6b088a1c”]]
01 + 03 + 03 + 01 + acaa4ece96da0e31
= [“ 01”,“ 03”,“ 03”,“ 01”,“ +”,“ acaa4ece96da0e31”'
011 + 031 + 03 + 01 + acaa4ece96da0e31
= [“ 011”,“ 031”,“ 03”,“ 01”,“ +”,“ acaa4ece96da0e31”
所有特定字母,比如'A',都需要在所有括号中替换.
例如,
A. (AbbAAbb) .A. (bbAbbAbA) .A. (bbbbAbbbb)
Run Code Online (Sandbox Code Playgroud)
我想用''替换括号中的所有'A',最后像这样:
A. (bbbb) .A. (bbbbb) .A. (bbbbbbbb)
Run Code Online (Sandbox Code Playgroud)
任何可能只在正则表达式中执行此操作?
我有一个运行 apache 服务器的计算实例,用于展示仪表板。我想使用 Iframe 在我的网站中嵌入仪表板。但是计算引擎的默认 IP 地址是 HTTP 而不是 https。
请让我知道如何将其更改为 https?
regex ×9
python ×3
javascript ×2
php ×2
arrays ×1
c# ×1
count ×1
duplicates ×1
file ×1
json ×1
perl ×1
preg-match ×1
split ×1
string ×1