]>
Commit | Line | Data |
---|---|---|
db3b92b4 CS |
1 | README 2007/05/31\r |
2 | \r | |
3 | Oniguruma ---- (C) K.Kosako <sndgk393 AT ybb DOT ne DOT jp>\r | |
4 | \r | |
5 | http://www.geocities.jp/kosako3/oniguruma/\r | |
6 | \r | |
7 | Oniguruma is a regular expressions library.\r | |
8 | The characteristics of this library is that different character encoding\r | |
9 | for every regular expression object can be specified.\r | |
10 | \r | |
11 | Supported character encodings:\r | |
12 | \r | |
13 | ASCII, UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE,\r | |
14 | EUC-JP, EUC-TW, EUC-KR, EUC-CN,\r | |
15 | Shift_JIS, Big5, GB18030, KOI8-R, CP1251,\r | |
16 | ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5,\r | |
17 | ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-10,\r | |
18 | ISO-8859-11, ISO-8859-13, ISO-8859-14, ISO-8859-15, ISO-8859-16\r | |
19 | \r | |
20 | * GB18030: contributed by KUBO Takehiro\r | |
21 | * CP1251: contributed by Byte\r | |
22 | ------------------------------------------------------------\r | |
23 | \r | |
24 | License\r | |
25 | \r | |
26 | BSD license.\r | |
27 | \r | |
28 | \r | |
29 | Install\r | |
30 | \r | |
31 | Case 1: Unix and Cygwin platform\r | |
32 | \r | |
33 | 1. ./configure\r | |
34 | 2. make\r | |
35 | 3. make install\r | |
36 | \r | |
37 | * uninstall\r | |
38 | \r | |
39 | make uninstall\r | |
40 | \r | |
41 | * test (ASCII/EUC-JP)\r | |
42 | \r | |
43 | make atest\r | |
44 | \r | |
45 | * configuration check\r | |
46 | \r | |
47 | onig-config --cflags\r | |
48 | onig-config --libs\r | |
49 | onig-config --prefix\r | |
50 | onig-config --exec-prefix\r | |
51 | \r | |
52 | \r | |
53 | \r | |
54 | Case 2: Win32 platform (VC++)\r | |
55 | \r | |
56 | 1. copy win32\Makefile Makefile\r | |
57 | 2. copy win32\config.h config.h\r | |
58 | 3. nmake\r | |
59 | \r | |
60 | onig_s.lib: static link library\r | |
61 | onig.dll: dynamic link library\r | |
62 | \r | |
63 | * test (ASCII/Shift_JIS)\r | |
64 | 4. copy win32\testc.c testc.c\r | |
65 | 5. nmake ctest\r | |
66 | \r | |
67 | \r | |
68 | \r | |
69 | Regular Expressions\r | |
70 | \r | |
71 | See doc/RE (or doc/RE.ja for Japanese).\r | |
72 | \r | |
73 | \r | |
74 | Usage\r | |
75 | \r | |
76 | Include oniguruma.h in your program. (Oniguruma API)\r | |
77 | See doc/API for Oniguruma API.\r | |
78 | \r | |
79 | If you want to disable UChar type (== unsigned char) definition\r | |
80 | in oniguruma.h, define ONIG_ESCAPE_UCHAR_COLLISION and then \r | |
81 | include oniguruma.h.\r | |
82 | \r | |
83 | If you want to disable regex_t type definition in oniguruma.h,\r | |
84 | define ONIG_ESCAPE_REGEX_T_COLLISION and then include oniguruma.h.\r | |
85 | \r | |
86 | Example of the compiling/linking command line in Unix or Cygwin,\r | |
87 | (prefix == /usr/local case)\r | |
88 | \r | |
89 | cc sample.c -L/usr/local/lib -lonig\r | |
90 | \r | |
91 | \r | |
92 | If you want to use static link library(onig_s.lib) in Win32,\r | |
93 | add option -DONIG_EXTERN=extern to C compiler.\r | |
94 | \r | |
95 | \r | |
96 | \r | |
97 | Sample Programs\r | |
98 | \r | |
99 | sample/simple.c example of the minimum (Oniguruma API)\r | |
100 | sample/names.c example of the named group callback.\r | |
101 | sample/encode.c example of some encodings.\r | |
102 | sample/listcap.c example of the capture history.\r | |
103 | sample/posix.c POSIX API sample.\r | |
104 | sample/sql.c example of the variable meta characters.\r | |
105 | (SQL-like pattern matching)\r | |
106 | \r | |
107 | Test Programs\r | |
108 | sample/syntax.c Perl, Java and ASIS syntax test.\r | |
109 | sample/crnl.c --enable-crnl-as-line-terminator test\r | |
110 | \r | |
111 | \r | |
112 | Source Files\r | |
113 | \r | |
114 | oniguruma.h Oniguruma API header file. (public)\r | |
115 | onig-config.in configuration check program template.\r | |
116 | \r | |
117 | regenc.h character encodings framework header file.\r | |
118 | regint.h internal definitions\r | |
119 | regparse.h internal definitions for regparse.c and regcomp.c\r | |
120 | regcomp.c compiling and optimization functions\r | |
121 | regenc.c character encodings framework.\r | |
122 | regerror.c error message function\r | |
123 | regext.c extended API functions. (deluxe version API)\r | |
124 | regexec.c search and match functions\r | |
125 | regparse.c parsing functions.\r | |
126 | regsyntax.c pattern syntax functions and built-in syntax definitions.\r | |
127 | regtrav.c capture history tree data traverse functions.\r | |
128 | regversion.c version info function.\r | |
129 | st.h hash table functions header file\r | |
130 | st.c hash table functions\r | |
131 | \r | |
132 | oniggnu.h GNU regex API header file. (public)\r | |
133 | reggnu.c GNU regex API functions\r | |
134 | \r | |
135 | onigposix.h POSIX API header file. (public)\r | |
136 | regposerr.c POSIX error message function.\r | |
137 | regposix.c POSIX API functions.\r | |
138 | \r | |
139 | enc/mktable.c character type table generator.\r | |
140 | enc/ascii.c ASCII encoding.\r | |
141 | enc/euc_jp.c EUC-JP encoding.\r | |
142 | enc/euc_tw.c EUC-TW encoding.\r | |
143 | enc/euc_kr.c EUC-KR, EUC-CN encoding.\r | |
144 | enc/sjis.c Shift_JIS encoding.\r | |
145 | enc/big5.c Big5 encoding.\r | |
146 | enc/gb18030.c GB18030 encoding.\r | |
147 | enc/koi8.c KOI8 encoding.\r | |
148 | enc/koi8_r.c KOI8-R encoding.\r | |
149 | enc/cp1251.c CP1251 encoding.\r | |
150 | enc/iso8859_1.c ISO-8859-1 encoding. (Latin-1)\r | |
151 | enc/iso8859_2.c ISO-8859-2 encoding. (Latin-2)\r | |
152 | enc/iso8859_3.c ISO-8859-3 encoding. (Latin-3)\r | |
153 | enc/iso8859_4.c ISO-8859-4 encoding. (Latin-4)\r | |
154 | enc/iso8859_5.c ISO-8859-5 encoding. (Cyrillic)\r | |
155 | enc/iso8859_6.c ISO-8859-6 encoding. (Arabic)\r | |
156 | enc/iso8859_7.c ISO-8859-7 encoding. (Greek)\r | |
157 | enc/iso8859_8.c ISO-8859-8 encoding. (Hebrew)\r | |
158 | enc/iso8859_9.c ISO-8859-9 encoding. (Latin-5 or Turkish)\r | |
159 | enc/iso8859_10.c ISO-8859-10 encoding. (Latin-6 or Nordic)\r | |
160 | enc/iso8859_11.c ISO-8859-11 encoding. (Thai)\r | |
161 | enc/iso8859_13.c ISO-8859-13 encoding. (Latin-7 or Baltic Rim)\r | |
162 | enc/iso8859_14.c ISO-8859-14 encoding. (Latin-8 or Celtic)\r | |
163 | enc/iso8859_15.c ISO-8859-15 encoding. (Latin-9 or West European with Euro)\r | |
164 | enc/iso8859_16.c ISO-8859-16 encoding.\r | |
165 | (Latin-10 or South-Eastern European with Euro)\r | |
166 | enc/utf8.c UTF-8 encoding.\r | |
167 | enc/utf16_be.c UTF-16BE encoding.\r | |
168 | enc/utf16_le.c UTF-16LE encoding.\r | |
169 | enc/utf32_be.c UTF-32BE encoding.\r | |
170 | enc/utf32_le.c UTF-32LE encoding.\r | |
171 | enc/unicode.c Unicode information data.\r | |
172 | \r | |
173 | win32/Makefile Makefile for Win32 (VC++)\r | |
174 | win32/config.h config.h for Win32\r | |
175 | \r | |
176 | \r | |
177 | \r | |
178 | ToDo\r | |
179 | \r | |
180 | ? case fold flag: Katakana <-> Hiragana.\r | |
181 | ? add ONIG_OPTION_NOTBOS/NOTEOS. (\A, \z, \Z)\r | |
182 | ?? \X (== \PM\pM*)\r | |
183 | ?? implement syntax behavior ONIG_SYN_CONTEXT_INDEP_ANCHORS.\r | |
184 | ?? transmission stopper. (return ONIG_STOP from match_at())\r | |
185 | \r | |
186 | and I'm thankful to Akinori MUSHA.\r | |
187 | \r | |
188 | \r | |
189 | Mail Address: K.Kosako <sndgk393 AT ybb DOT ne DOT jp>\r |