中文/中文标点

1.中文的unicode:

[\u4E00-\u9FFF]

2.常用中文标点的unicode:

[\u3002\uff1b\uff0c\uff1a\u201c\u201d\uff08\uff09\u3001\uff1f\u300a\u300b\uff01\u3010\u3011\uffe5]
3.全局匹配中文和中文标点的正则:
new RegExp(
"([\u4E00-\u9FFF]|[\u3002\uff1b\uff0c\uff1a\u201c\u201d\uff08\uff09\u3001\uff1f\u300a\u300b\uff01\u3010\u3011\uffe5])+","g"
)

 

posted @ 2021-01-27 16:41  龙不吟虎不啸  阅读(120)  评论(0)    收藏  举报