pcre函数详细解析

2025-12-31 18:54:30

PCRE是一个NFA正则引擎，不然不能提供完全与Perl一致的正则语法功能。但它同时也实现了DFA，只是满足数学意义上的正则。

1. pcre_compile

原型：
#include <pcre.h>
pcre *pcre_compile(const char *pattern, int options, const char **errptr, int *erroffset, const unsigned char *tableptr);

功能：将一个正则表达式编译成一个内部表示，在匹配多个字符串时，可以加速匹配。其同pcre_compile2功能一样只是缺少一个参数errorcodeptr。

参数：
pattern    正则表达式
options     为0，或者其他参数选项
errptr出错消息
erroffset 出错位置
tableptr   指向一个字符数组的指针，可以设置为空NULL

示例：

代码如下:

L1720 re = pcre_compile((char *)p, options, &error, &erroroffset, tables);

2. pcre_compile2

原型：
#include <pcre.h>
pcre *pcre_compile2(const char *pattern, int options, int *errorcodeptr, const char **errptr, int *erroffset, const unsigned char *tableptr);

功能：将一个正则表达式编译成一个内部表示，在匹配多个字符串时，可以加速匹配。其同pcre_compile功能一样只是多一个参数errorcodeptr。

参数：
pattern    正则表达式
options     为0，或者其他参数选项
errorcodeptr    存放出错码
errptr出错消息
erroffset 出错位置
tableptr   指向一个字符数组的指针，可以设置为空NULL

3. pcre_config

原型：
#include <pcre.h>
int pcre_config(int what, void *where);

功能：查询当前PCRE版本中使用的选项信息。

参数：
what 选项名
where存储结果的位置

示例：

Line1312 (void)pcre_config(PCRE_CONFIG_POSIX_MALLOC_THRESHOLD, &rc);

4. pcre_copy_named_substring

原型：
#include <pcre.h>
int pcre_copy_named_substring(const pcre *code, const char *subject, int *ovector, int stringcount, const char *stringname, char *buffer, int buffersize);

功能：根据名字获取捕获的字串。

参数：
code成功匹配的模式
subject 匹配的串
ovectorpcre_exec() 使用的偏移向量
stringcount   pcre_exec()的返回值
stringname捕获字串的名字
buffer   用来存储的缓冲区
buffersize     缓冲区大小

示例：

代码如下:

Line2730 int rc = pcre_copy_named_substring(re, (char *)bptr, use_offsets,

count, (char *)copynamesptr, copybuffer, sizeof(copybuffer));

5. pcre_copy_substring

原型：
#include <pcre.h>
int pcre_copy_substring(const char *subject, int *ovector, int stringcount, int stringnumber, char *buffer, int buffersize);

功能：根据编号获取捕获的字串。

参数：
code成功匹配的模式
subject 匹配的串
ovectorpcre_exec() 使用的偏移向量
stringcount   pcre_exec()的返回值
stringnumber   捕获字串编号
buffer   用来存储的缓冲区
buffersize     缓冲区大小

示例：

代码如下:

Line2730 int rc = pcre_copy_substring((char *)bptr, use_offsets, count,

i, copybuffer, sizeof(copybuffer));

6. pcre_dfa_exec

原型：
#include <pcre.h>
int pcre_dfa_exec(const pcre *code, const pcre_extra *extra, const char *subject, int length, int startoffset, int options, int *ovector, int ovecsize, int *workspace, int wscount);

功能：使用编译好的模式进行匹配，采用的是一种非传统的方法DFA，只是对匹配串扫描一次（与Perl不兼容）。

参数：
code     编译好的模式
extra 指向一个pcre_extra结构体，可以为NULL
subject    需要匹配的字符串
length匹配的字符串长度（Byte）
startoffset 匹配的开始位置
options     选项位
ovector    指向一个结果的整型数组
ovecsize   数组大小
workspace 一个工作区数组
wscount   数组大小

示例：

代码如下:

Line2730 count = pcre_dfa_exec(re, extra, (char *)bptr, len, start_offset,

options | g_notempty, use_offsets, use_size_offsets, workspace,

sizeof(workspace)/sizeof(int));

7. pcre_copy_substring

原型：
#include <pcre.h>
int pcre_exec(const pcre *code, const pcre_extra *extra, const char *subject, int length, int startoffset, int options, int *ovector, int ovecsize);

功能：使用编译好的模式进行匹配，采用与Perl相似的算法，返回匹配串的偏移位置。。

参数：
code     编译好的模式
extra 指向一个pcre_extra结构体，可以为NULL
subject    需要匹配的字符串
length匹配的字符串长度（Byte）
startoffset 匹配的开始位置
options     选项位
ovector    指向一个结果的整型数组
ovecsize   数组大小

8. pcre_free_substring

原型：
#include <pcre.h>
void pcre_free_substring(const char *stringptr);

功能：释放pcre_get_substring()和pcre_get_named_substring()申请的内存空间。

参数：
stringptr 指向字符串的指针

示例：

代码如下:

Line2730 const char *substring;

int rc = pcre_get_substring((char *)bptr, use_offsets, count,

i, &substring);

……

pcre_free_substring(substring);

9. pcre_free_substring_list

原型：
#include <pcre.h>
void pcre_free_substring_list(const char **stringptr);

功能：释放由pcre_get_substring_list申请的内存空间。

参数：
stringptr 指向字符串数组的指针

示例：

代码如下:

Line2773 const char **stringlist;
int rc = pcre_get_substring_list((char *)bptr, use_offsets, count,
……
pcre_free_substring_list(stringlist);

10. pcre_fullinfo

原型：
#include <pcre.h>
int pcre_fullinfo(const pcre *code, const pcre_extra *extra, int what, void *where);

功能：返回编译出来的模式的信息。

参数：
code 编译好的模式
extra pcre_study()的返回值，或者NULL
what 什么信息
where存储位置

示例：

代码如下:

Line997 if ((rc = pcre_fullinfo(re, study, option, ptr)) < 0)
fprintf(outfile, "Error %d from pcre_fullinfo(%d)/n", rc, option);
}

11. pcre_get_named_substring

原型：
#include <pcre.h>
int pcre_get_named_substring(const pcre *code, const char *subject, int *ovector, int stringcount, const char *stringname, const char **stringptr);

功能：根据编号获取捕获的字串。

参数：
code成功匹配的模式
subject 匹配的串
ovectorpcre_exec() 使用的偏移向量
stringcount pcre_exec()的返回值
stringname捕获字串的名字
stringptr 存放结果的字符串指针

示例：

代码如下:

Line2759 const char *substring;
int rc = pcre_get_named_substring(re, (char *)bptr, use_offsets,
count, (char *)getnamesptr, &substring);

12. pcre_get_stringnumber

原型：
#include <pcre.h>
int pcre_get_stringnumber(const pcre *code, const char *name);

功能：根据命名捕获的名字获取对应的编号。

参数：
code成功匹配的模式
name 捕获名字

13. pcre_get_substring

原型：
#include <pcre.h>
int pcre_get_substring(const char *subject, int *ovector, int stringcount, int stringnumber, const char **stringptr);

功能：获取匹配的子串。

参数：
subject成功匹配的串
ovectorpcre_exec() 使用的偏移向量
stringcount pcre_exec()的返回值
stringnumber 获取的字符串编号
stringptr 字符串指针

14. pcre_get_substring_list

原型：
#include <pcre.h>
int pcre_get_substring_list(const char *subject, int *ovector, int stringcount, const char ***listptr);

功能：获取匹配的所有子串。

参数：
subject成功匹配的串
ovectorpcre_exec() 使用的偏移向量
stringcount pcre_exec()的返回值
listptr 字符串列表的指针

15. pcre_info

原型：
#include <pcre.h>
int pcre_info(const pcre *code, int *optptr, int *firstcharptr);

已过时，使用pcre_fullinfo替代。

16. pcre_maketables

原型：
#include <pcre.h>
const unsigned char *pcre_maketables(void);

功能：生成一个字符表，表中每一个元素的值不大于256，可以用它传给pcre_compile()替换掉内建的字符表。

参数：

示例：
Line2759 tables = pcre_maketables();

17. pcre_refcount

原型：
#include <pcre.h>
int pcre_refcount(pcre *code, int adjust);

功能：编译模式的引用计数。

参数：
code已编译的模式

adjust 调整的引用计数值

18. pcre_study

原型：
#include <pcre.h>
pcre_extra *pcre_study(const pcre *code, int options, const char **errptr);

功能：对编译的模式进行学习，提取可以加速匹配过程的信息。

参数：
code      已编译的模式
options    选项
errptr     出错消息

示例：
Line1797 extra = pcre_study(re, study_options, &error);

19. pcre_version

原型：
#include <pcre.h>
char *pcre_version(void);

功能：返回PCRE的版本信息。
参数：
示例：
Line1384 if (!quiet) fprintf(outfile, "PCRE version %s/n/n", pcre_version());

深度分析正则(pcre)最大回溯/递归限制

今天,Tank问了一个问题, 对于如下的正则: 复制代码代码如下: /<script>.*?<\/script>/i 当要匹配的字符串长度大于100014的时候, 就不会得出正确结果: 复制代码代码如下: $reg = "/<script>.*?<\/script>/is"; $str = "<script>********</script>"; //长度大于100014 $ret = pr
pcre函数详细解析

PCRE是一个NFA正则引擎,不然不能提供完全与Perl一致的正则语法功能.但它同时也实现了DFA,只是满足数学意义上的正则. 1. pcre_compile 原型:#include <pcre.h> pcre *pcre_compile(const char *pattern, int options, const char **errptr, int *erroffset, const unsigned char *tableptr); 功能:将一个正则表达式编译成一个内部表示,在匹配多个
Es6 Generator函数详细解析

ECMAScript 6 (简称 ES6 )作为下一代 JavaScript 语言,将 JavaScript 异步编程带入了一个全新的阶段. Generator函数跟普通函数的写法有非常大的区别: 一是,function关键字与函数名之间有一个星号: 二是,函数体内部使用yield语句,定义不同的内部状态(yield在英语里的意思就是"产出"). 本文重点给大家介绍Es6 Generator函数,具体内容如下所示: /* 一.generator函数的定义 1.Generator 函数是
Python内置函数详细解析

目录 1.abs 2.all 3.any 4.callable 5.dir 6.id 7.locals 和 globals 8.hash 9.sum 10.getattr.setattr.delattr 前言: Python 自带了很多的内置函数,极大地方便了我们的开发,下面就来挑几个内置函数,看看底层是怎么实现的.内置函数位于 Python/bitlinmodule.c 中. 1.abs abs 的功能是取一个整数的绝对值,或者取一个复数的模. static PyObject * builti
C++中 Sort函数详细解析

目录前言一.sort函数调用的两种方式二.sort函数使用场景三.sort函数排序原理四.sort函数使用案例 1.升序排列 2.降序排列实现方式1 实现方式2 3.结构体排序(自定义比较函数) 五.自定义comp函数返回true或false作用前言 sort函数是algorithm库下的一个函数,sort函数是不稳定的,即大小相同的元素在排序后相对顺序可能发生改变,如果某些场景需要保持相同元素间的相对顺序,可使用stable_sort函数,这里不过多介绍. 一.sort函数调用的
JavaScript自定义日期格式化函数详细解析

我们对 JavaScript 扩展其中一个较常的做法便是对 Date.prototype 的扩展.因为我们知道,Date 类只提供了若干获取日期元素的方法,如 getDate(),getMinute()--却没有一个转换为特定字符串的格式化方法.故所以,利用这些细微的方法,加以封装,组合我们想要的日期字符串形式.一般来说,该格式化函数可以定义在 Date 对象的原型身上,也可以独立一个方法写出.定义原型方法的操作如 Date.prototype.format = function(date){-
PHP json_decode函数详细解析

一.函数简介1.此函数有四个参数,一般在使用时会用到前两个,具体的参数可以看手册. 2.第一个参数是json格式的字符串,第二个参数是boolean值(false转化成对象,true转化成数组,默认false),如果转化失败返回null. 二.遇到的问题在项目中调用了java写的一个web服务,返回的数据是"{'stauts':'1','message':'充值成功'}". 程序接到后用json_decode把结果转化成数组,但转化的结果是null,太奇怪了. 看了一下手册,在手册中发
C++中的friend函数详细解析

为什么要使用友元函数在实现类之间数据共享时,减少系统开销,提高效率.如果类A中的函数要访问类B中的成员(例如:智能指针类的实现),那么类A中该函数要是类B的友元函数.具体来说:为了使其他类的成员函数直接访问该类的私有变量.即:允许外面的类或函数去访问类的私有变量和保护变量,从而使两个类共享同一函数. 实际上具体大概有下面两种情况需要使用友元函数:(1)运算符重载的某些场合需要使用友元.(2)两个类要共享数据的时候. 使用友元函数的优缺点优点:能够提高效率,表达简单.清晰. 缺点:友元函数破环
C语言static修饰函数详细解析

在C语言中,static的字面意思很容易把我们导入歧途,其实它的作用有三条.介绍它的第一条也是最重要的一条:隐藏.当我们同时编译多个文件时,所有未加static前缀的全局变量和函数都具有全局可见性.为理解这句话,我举例来说明.我们要同时编译两个源文件,一个是a.c,另一个是main.c. 下面是a.c的内容复制代码代码如下: char a = 'A'; // global variablevoid msg() { printf("Hello\n"); } 下面是main.c的
C++空类默认函数详细解析

定义一个空的C++类,例如 class Empty{} 一个空的class在C++编译器处理过后就不再为空,编译器会自动地为我们声明一些member function,一般编译过去就相当于复制代码代码如下: class Empty{public:Empty(); // 缺省构造函数Empty( const Empty& ); // 拷贝构造函数~Empty(); // 析构函数Empty& operator=( const Empty& ); // 赋值运算符Empty* op
C++中的friend友元函数详细解析

友元函数是可以直接访问类的私有成员的非成员函数.它是定义在类外的普通函数,它不属于任何类,但需要在类的定义中加以声明,声明时只需在友元的名称前加上关键字friend. 我们已知道类具有封装和信息隐藏的特性.只有类的成员函数才能访问类的私有成员,程序中的其他函数是无法访问私有成员的.非成员函数可以访问类中的公有成员,但是如果将数据成员都定义为公有的,这又破坏了隐藏的特性.另外,应该看到在某些情况下,特别是在对某些成员函数多次调用时,由于参数传递,类型检查和安全性检查等都需要时间开销,而影响程序的运

pcre函数详细解析

相关推荐

随机推荐