正则表达式记事本++删除除日语字符之外的所有字符

Rax*_*aru 3 regex notepad++

我一直在寻找一段时间,但仍然无法弄明白.我发现像([一 - 龯])|([ぁ - んァ - ン])这样的表达式,但不知道除了这些之外如何删除所有.

我喜欢做的是这样的:

1. ????????? ohayou gozaimasu – Good morning!
2. ?????  konnichiwa – Hello / Good afternoon!
3. ????  hisashiburi – It’s been a while.
4. ????? jaa mata – See you!
5. ????  o genki de – Take care.
6. o namae wa nan desu ka – What’s your name? ????????? 
7. …desu – I’m……?? 
8. suki desu – I like it. ???? 
9. If you’re not going to be waving your handkerchief in a heartfelt goodbye (where “?????” [sayounara] or “farewell” would be more appropriate), then ????? is a great way to say “see you”! You can also use ????[dewa mata] which is slightly more polite. ???? (see ya) and ????????  (jaa mata ashita ne – see you tomorrow) are also quick and friendly phrases for parting. If you’re in Osaka, then be sure to try some ??? (oosaka ben – Osaka dialect) and say, ???[hona ne]!
10. Example. ????????????
11. 1402  10?               
12. 1405  11?               
13  1408  12?               
14  1411  1?            
15  1414  2?            
16. 1450  3??           
17. 1474  30??   

?????????
?????
????
?????
????
????????
??
????
????? ????? ???? ???? ???????? ??? ???
????????????
10?
11?
12?
1?
2?
3??
30??

Riz*_*man 5

根据Gurman和KenY-N指出的op的评论更新:

搜索方式:

[^?-??-??-??-??-?\r\n]+(?![?-??-??-??-??-?\d])|[…“”’\"'–]
Run Code Online (Sandbox Code Playgroud)

并替换为:

empty
Run Code Online (Sandbox Code Playgroud)

说明:

  1. ^ 否定类,意味着捕获内部没有的任何东西[]
  2. [^?-??-??-??-??-?\r\n\b] 您不想删除的范围包括平假名,katagana等.
  3. \r\n 你需要它不要替换换行符
  4. + 表示课程中重复的字符
  5. (?![?-??-??-??-??-?\d]) 这种消极的前瞻确保任何先前的捕获都没有日语字符和数字
  6. |[…“”’–] 这将确保删除这个丢失的标点符号.

演示