mb_regex_encoding() - mb函数(多字节字符串转化库)
mb_regex_encoding()
(PHP 4 >= 4.2.0, PHP 5, PHP 7)
Set/Get character encoding for multibyte regex
说明
mb_regex_encoding([string $encoding= mb_regex_encoding()]): mixedSet/Get character encoding for a multibyte regex.
参数
$encoding$encoding参数为字符编码。如果省略,则使用内部字符编码。
返回值
If$encodingis set, then 成功时返回TRUE
,或者在失败时返回FALSE
。 In this case, the internal character encoding is NOT changed. If$encodingis omitted, then the current character encoding name for a multibyte regex is returned.
更新日志
版本 | 说明 |
---|---|
5.6.0 | Default encoding is changed to UTF-8. It was EUC-JP Previously. |
参见
mb_internal_encoding()
设置/获取内部字符编码mb_ereg()
Regular expression match with multibyte support
Beware, mb_regex_encoding does not support the same set of encodings as listed in mb_list_encodings.php Example:
mb_ereg functionality is provided via Oniguruma RegEx library and not via PCRE. mb_regex_encoding() does only support a subset of encoding names, compared to mb_list_encodings() and mb_encoding_aliases(). Currently the following names are supported (case-insensitive): UCS-4 UCS-4LE UTF-32 UTF-32BE UTF-32LE UTF-16 UTF-16BE UTF-16LE UTF-8 utf8 ASCII US-ASCII EUC-JP eucJP x-euc-jp SJIS eucJP-win SJIS-win CP932 MS932 Windows-31J ISO-8859-1 ISO-8859-2 ISO-8859-3 ISO-8859-4 ISO-8859-5 ISO-8859-6 ISO-8859-7 ISO-8859-8 ISO-8859-9 ISO-8859-10 ISO-8859-13 ISO-8859-14 ISO-8859-15 ISO-8859-16 EUC-CN EUC_CN eucCN gb2312 EUC-TW EUC_TW eucTW BIG-5 CN-BIG5 BIG-FIVE BIGFIVE EUC-KR EUC_KR eucKR KOI8-R KOI8R The list is a mixture of base names and aliases and applies to PHP 5.4.45 (Oniguruma lib v4.7.1), PHP 5.6.31 (v5.9.5), PHP 7.0.22 (v5.9.6) and PHP 7.1.8 (v5.9.6). Be aware of the inconsistency: mb_regex_encoding() accepts for example the base name 'UTF-8' and its only alias 'utf8', but it does not accept aliases 'utf16', 'utf32' or 'latin1'. Additionally note, that the informal name/alias 'latin9' for ISO/IEC 8859-15:1999 (including the Euro sign on 0xA4) is also not known by mb_list_encodings(). It can only be adressed as 'ISO-8859-15' or 'ISO_8859-15' and for mb_regex_encoding() solely as 'ISO-8859-15'.
mb_regex_encoding does not recognize CP1252 or Windows-1252 as valid encodings, although they are in the list generated by mb_list_encodings. ISO-8859-1 (AKA "Latin-1") is supported, but it's not the same as the Windows variety of Latin-1.
To change algo the regex_encodign
Return values vary in setting and getting:
鹏仔微信 15129739599 鹏仔QQ344225443 鹏仔前端 pjxi.com 共享博客 sharedbk.com
免责声明:我们致力于保护作者版权,注重分享,当前被刊用文章因无法核实真实出处,未能及时与作者取得联系,或有版权异议的,请联系管理员,我们会立即处理! 部分文章是来自自研大数据AI进行生成,内容摘自(百度百科,百度知道,头条百科,中国民法典,刑法,牛津词典,新华词典,汉语词典,国家院校,科普平台)等数据,内容仅供学习参考,不准确地方联系删除处理!邮箱:344225443@qq.com)
图片声明:本站部分配图来自网络。本站只作为美观性配图使用,无任何非法侵犯第三方意图,一切解释权归图片著作权方,本站不承担任何责任。如有恶意碰瓷者,必当奉陪到底严惩不贷!
内容声明:本文中引用的各种信息及资料(包括但不限于文字、数据、图表及超链接等)均来源于该信息及资料的相关主体(包括但不限于公司、媒体、协会等机构)的官方网站或公开发表的信息。部分内容参考包括:(百度百科,百度知道,头条百科,中国民法典,刑法,牛津词典,新华词典,汉语词典,国家院校,科普平台)等数据,内容仅供参考使用,不准确地方联系删除处理!本站为非盈利性质站点,本着为中国教育事业出一份力,发布内容不收取任何费用也不接任何广告!)