duerst, Martin Dürst

Hash Date Message
1b79b5be82ac6baa5be2c4483ab44143493069d5 2007-12-20 Thu Dec 20 17:15:15 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
968eb8beac192fcbfad25151879d505911716e99 2007-12-21 Fri Dec 21 16:25:25 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
5ad8c5566d77be1d7ad37be2a9a2ce43671c5577 2007-12-22 Sat Dec 22 15:45:45 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
54f273406969c3a47671bac197945d839492a3e0 2007-12-22 Sat Dec 22 15:54:54 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
a95ae9619f4e3e831d50de1c545bc226f70c4ef0 2007-12-22 Sat Dec 22 15:54:54 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
4ef8764fef56fbe6a1a0e6da0ac62889f8893186 2007-12-23 Sun Dec 23 10:23:23 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
9c7718ac6b7b91d2613a02b8630b2916abf67f2f 2007-12-24 Mon Dec 24 09:45:45 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
e7ac333ba841e774911eac23ffb680ab3de7a993 2007-12-25 Tue Dec 25 12:32:32 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
793e9423cd4c4c679959b7dd011040b5aba2d169 2007-12-28 Fri Dec 28 01:55:04 2007 Martin Duerst <duerst@it.aoyama.ac.jp>
5f31c7b54819917b41bcb06ac7761caf2c94da81 2008-01-14 Mon Jan 14 10:45:45 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
3d0c7bea4d2ad108889d0c4d81d41c4ff03f2a77 2008-01-20 Sun Jan 20 15:08:08 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
1e75c881da205e185b2bbadf35a68d509414654e 2008-01-20 Sun Jan 20 15:08:08 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
a9b15a4e0cf2f26e05cf0c6761daf7f04f2cf196 2008-01-20 Sun Jan 20 20:00:20 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
38321fc0ebe2c434be24f67703f5b9f4096ee4b7 2008-01-21 Mon Jan 21 19:42:42 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
ef3fdbca15c0883d9ed1987b153a32c5ffeaf8ff 2008-01-22 Tue Jan 22 17:52:52 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
6d5ef97a32f74917efaa53049815873c4ae00aa1 2008-02-21 Thu Feb 21 17:15:15 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
ad3577b59b7ba2b3f533bfcb697492fc9f56762f 2008-02-25 Mon Feb 25 10:41:41 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
08631278ada7a6fd2bafb3ab0f0447b1f6d58790 2008-03-05 Web Mar 5 17:43:43 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
2e7815dd8000dc4d3ef7f2443bf5fd045812ee9e 2008-03-16 Sun Mar 16 18:07:07 2008 Martin Duerst <duerst@it.aoyama.ac.jp>
ba3fe885d5b5f8d1b8be43e11a9c484f6831bb9c 2008-07-22 * test/ruby/test_transcode.rb: added two comments
3e53486295797ef5a2df6a4d684381d6f7d4af66 2008-07-25 * test/ruby/test_transcode.rb: refactoring/cleanup of test_iso_2022_jp(_1)
0469c8d95b93e131814c9a4014eac05bc9570d98 2008-07-31 test/ruby/test_transcode.rb: added test_shift_jis (contributed by Yoshihiro Kambayashi) and test_public_review_issue_121 (see http://www.unicode.org/review/pr-121.html)
5dd5311fdf839f3abbe18cef21a5f16c060cef9e 2008-08-22 * test/ruby/test_transcode.rb: test_shift_jis: fixed comment strings (see r18291)
c5e46ef397ce68fda7cc8f1c0362f663846ef383 2008-08-26 * test/ruby/test_transcode.rb: added test_euc_jp (contributed by Yoshihiro Kambayashi)
48a303c0277a8fd7c90fe0fd1cb4f908ab138e03 2008-10-14 * enc/trans/single_byte.trans: added windows-1252
da1ffe490d589fddcca75f410634b73052668443 2008-10-18 * transcode.c: in transcode_search_path, elimintated a warning on cygwin about pathlen potentially not being initialized
7adbfbb7937f52a9df8792048882a93db466e52e 2008-10-18 * tool/transcode-tblgen.rb: added set_valid_byte_pattern to reduce coupling between table generation script and specific encodings.
45e68487416bad94261f1a022e660d28b4211e5e 2008-10-18 ChangeLog: moved entries back and corrected date
b014f1bc021702ee2800752aa37bfe7848118f2a 2008-10-19 * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings (wwww = 874/1250/1251/1253/1254/1255/1256/1257) (contributed by Yoshihiro Kambayashi)
6fd14ccae523c3fab7f202664bb3ef0125e80313 2008-10-30 * enc/trans/single_byte.trans: refactoring to make it easier to add more transcodings (with Yoshihiro Kambayashi)
d37df9fb13bd2cf46ddd0cafb3bfe1885ac809e9 2008-10-31 * enc/trans/single_byte.trans, maccroatioan-tbl.rb, maccyrillic-tbl.rb, maciceland-tbl.rb: new single-byte transcodings (from Yoshihiro Kambayashi)
831e80438840a6665f99a7e27e7c978bd035a306 2008-11-11 * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, macromania-tbl.rb, macturkish-tbl.rb, macukraine-tbl.rb, ibm437-tbl.rb, ibm852-tbl.rb, ibm855-tbl.rb, ibm857-tbl.rb, ibm860-tbl.rb, ibm861-tbl.rb, ibm862-tbl.rb, ibm863-tbl.rb, ibm865-tbl.rb, ibm866-tbl.rb, ibm869-tbl.rb, ibm775-tbl.rb: new single-byte transcodings (from Yoshihiro Kambayashi)
0d93d8125e0ae23599ea6d562d70d235171afc9c 2008-11-11 * test/ruby/test_transcode.rb: unnecessary setup method (setup_really_needed?) removed
a28fdecda7bfd3f5a5013fc9067c676d4c0ffb35 2008-12-09 * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, cp855-tbl.rb, koi8-r-tbl.rb, koi8-u-tbl.rb, tis-620-tbl.rb: new single-byte transcodings (from Yoshihiro Kambayashi)
bf6c750c350b021578c3c446407315c404d6652b 2008-12-30 * transcode.c: Minor fixes and tweaks in documentation.
3bc30f0b73d917360b0a550eafbe6894fbf0b334 2009-01-04 * enc/trans/big5.trans, big5-tbl.rb: new Chinese Big5 transcoding (from Yoshihiro Kambayashi)
fecce9e5e5293ebc8fbb161e8b5ecc3884a27d4e 2009-01-04 * test/ruby/test_transcode.rb: added tests for GB2312 (from Yoshihiro Kambayashi)
deeade6f3e249d92062d52a62a777bb495bd6e63 2009-01-04 * enc/trans/gbk.trans, gbk-tbl.rb: new Chinese GBK transcoding (from Yoshihiro Kambayashi)
c2cc750dd7ab1b9e4ef3c742f2f8e554736e5e58 2009-01-13 ext/socket/socket.c: removed warning about unused variable 'rai'.
654e94e1c976d675ac464e7edc1c1891a4a6b780 2009-01-13 Commit miss in previous commit.
82c673d3a1cab4a9f8a1e9ac30b28f108d726606 2009-01-14 * enc/trans/gb18030.trans, gb18030-tbl.rb: new Chinese GB18030 transcoding (from Yoshihiro Kambayashi)
0149e4515da3ef6508e9ecca7f422c029f1b9c22 2009-01-15 * LEGAL: Added information for missing/langinfo.c
0be9fb8caa6e511336dfb9f8348e55b20b1f75f3 2009-01-16 Commit miss at r21521, poited out by Yugui.
28862075842f6e8e62d00954a291b01ba80e61a1 2009-07-24 * enc/trans/big5.trans, big5-hkscs-tbl.rb: new Chinese BIG5-HKSCS transcoding (with Tatsuya Mizuno)
5e85648b770f6999bc0020ddddfe0bcdbf5149c5 2009-07-30 * transcode.c: added check for frozen string for encode! (see Bug #1836)
9d7e47114cca19a355f1410f2b7b48f150f0858c 2009-09-28 include/ruby/st.h: aligned prototype of st_hash_uint32 with function definition (fixing compiling problem on cygwin)
e0436c54c21343580d5fa6b9334fbfa20e10c646 2009-11-17 * enc/big5.c, enc/trans/big5.trans, enc/trans/big5-uao-tbl.rb, test/ruby/test-transcode.rb: Added Encoding 'Big5-UAO' and transcoding for it (from Tatsuya Mizuno) (see Bug #1784)
80b54214b288a7442d93a7b1e39311c3ce8b9f56 2009-11-25 * transcode.c: Added a check for an internal error (with Tatsuya Mizuno)
9998481d4efaf86dde652617ed3b2777924c3373 2009-12-03 * enc/trans/gb18030-tbl.rb: Fix omission of C1 region in code table (from Tatsuya Mizuno)
b32ee85f97597e1cb13b568dff15ee9a6d9de1ba 2009-12-10 * transcode_data.h, transcode.c, tool/transcode-tblgen.rb: Added support for new transcoding instruction FUNsio (with Tatsuya Mizuno)
5ff2c82a080c9677ea1e752117f2d7c2d5fea342 2010-11-29 test/ruby/test_transcode.rb (test_unicode_public_review_issue_121): - Removed commented-out options that are no longer under discussion. - Added two more tests for forthcomming clarifications.
0dfbb49a8350957c6af6f8fb43b297422df81095 2011-06-11 fixed a grammatical error
642562c707d3e53104f4088417761b783c744db5 2011-11-05 insns.def: Some fixes and tweaks to English explanations
ed73a902b80cec1bc42465fba4a07b8a6503e6ab 2011-11-30 transcode.c: Simplified rb_econv_binmode, avoided a warning on cygwin.
463633e4a934a00f869086a6ffbf84c6cb8ad630 2012-03-23 transcode.c (documentation for str_encode): Explain that transcoding to the same encoding is a no-op (i.e. no exceptions, no replacements,...). [ruby-core:43557][Bug #6190]
611159923088a098ee5903e7fb356ec7f12debd1 2012-04-06 struct.c (documentation for rb_struct_members_m): fix 'array of strings' to 'array of symbols' [ruby-core:44152][Bug #6264]
35020e355cdabd2edf1d9e264cd1ff12993fe027 2012-11-10 ruby.c: removed a comma before "before" (at Tamatsukuri Hot Spring (Onsen Hackathon))
e5ce9579ecb93316d71eacf6c16a187767f05f68 2013-04-23 lib/rexml/document.rb, lib/rexml/element.rb, lib/rexml/formatters/pretty.rb: remove opinionated language in documentation. [Bug #8309], reported by Charles Beckmann
d5f642ecc70c6bdacd227bc424d8f3bf07cc2417 2013-11-06 lib/open3.rb: tweaked grammar in comments
737c7d816bc5344206b917ef419efa83568a5175 2013-12-05 st.c: tweaked comment
96245d6dd37557acd9a066491db6c80b125f0676 2014-09-23 tool/downloader.rb: added Downloader.download_if_modified_since to reduce downloads of large files that change only rarely. [ruby-core:65164] [CommonRuby - Feature #10084]
9c0f09dae0a4370f61e4fd3448eeca689810e5cc 2014-09-24 tool/downloader.rb: Small fix to documentation comment.
d2172c8d190c60b4d218130df15a540ed7f72d35 2014-09-24 tool/downloader.rb: Made Unicode data file location available via :unicode Symbol.
d081032d6b49f1df9e32e892573b51968854d07c 2014-09-24 tool/downloader.rb: Fixing raise after return.
b6f2f67cc91ae10feeed4205c0f0be951859ece6 2014-09-24 tool/downloader.rb: Removing unused method Downloader.download_if_modified_since. (if ever used, just replace with Downloader.download)
f508efdb9b541893315cdc7d65978bc027bba601 2014-09-24 tool/downloader.rb: Adjusting example for Downloader.download to implementation changes in r47693.
ba63472a2f96fa5098bf1ff540693ad74e753e06 2014-09-24 enc/unicode/data: New directory for downloaded Unicode data files.
69cbd810faa7556dca3f1a9a32da752a34880102 2014-09-28 common.mk: Created new target update-unicode to download
550406c0167594f38ed9d061137fb59e38f714b6 2014-10-06 lib/unicode_normalize: New folder for Unicode normalization functionality
0fb67d59b2279540d99333ef1ef601e826fdf5d6 2014-10-06 tool/downloader.rb: Adjust example in documentation for Downloader.download.
33447b80d52f395b26c31a907648503129b1d077 2014-10-06 tool/unicode_norm_gen.rb: Data generation script imported from https://github.com/duerst/eprun/blob/master/lib/generate.rb
42b382841ce4b6d341d6b40b7fab75445f4319d6 2014-10-06 tool/unicode_norm_gen.rb: Adjusted directory paths.
daea1a5f3ae7a2faaab131f74c79146ff080523f 2014-10-06 tool/unicode_norm_gen.rb: Adding/tweaking comments.
6017de0314b5115183c04b81562bbe7fdb71959a 2014-10-19 lib/unicode_normalize/normalize.rb: Importing from https://github.com/duerst/eprun/blob/master/lib/normalize.rb.
4c769ce02149508d9bce84ce84739563175bf6b4 2014-10-19 lib/unicode_normalize/normalize.rb: Changed module name, adjusted copyright.
deeb1313bc7a823269d5ec2362b089445f8d6291 2014-10-19 tool/unicode_norm_gen.rb: Changed module name.
e4f71908d62d7e55a55888ad00dce25eb66a1dea 2014-10-19 lib/unicode_normalize.rb: Importing from https://github.com/duerst/eprun/blob/master/lib/string_normalize.rb. (removing trailing whitespace, fixing EOLs and adding EOL property)
b95ec57da2bfba783b299c593c5654e932c29992 2014-10-19 lib/unicode_normalize.rb: Changing require statement, adjusting copyright.
588bbfd1b4e68172fc3d205776e43664da46d8c0 2014-10-19 lib/unicode_normalize.rb: Changing module name.
dfb68b3d21901da1908a94a721e135b4d8fc97dc 2014-10-19 lib/unicode_normalize.rb: Changing method names, see https://bugs.ruby-lang.org/issues/10084#note-7
982f0de1418e185b3d9e3aaf31fc5a76ffcf941d 2014-10-19 tool/unicode_norm_gen.rb, lib/unicode_normalize.rb: File name change from lib/unicode_normalize/normalize_tables.rb to lib/unicode_normalize/tables.rb.
f83c0b0ae7e2d92e17d94c9de28375b091554351 2014-10-19 lib/unicode_normalize.rb: Added documentation.
281bfbd9cd5f72d08a4dd550f92cf51b8a6f656f 2014-10-19 lib/unicode_normalize.rb: Small documentation fix.
d72053c0a39447300706f1b64b6234292573383a 2014-10-19 lib/unicode_normalize.rb: Changed to dynamic loading of actual normalization code and tables.
25f3baf7b67ac17e8151f13418152d258bd1f609 2014-10-19 common.mk: Added a rule to generate lib/unicode_normalize/tables.rb. This rule still needs to be integrated into the overall make process.
8c722a9a1e26820fd466681b506fa0e23e6dddb1 2014-10-19 lib/unicode_normalize/normalize.rb: Added a missing file extension in require statement.
824670aa9e499edb17bce1f57e1b197968d2f380 2014-10-19 enc/prelude.rb: Added automatic loading of lib/unicode_normalize.rb. This makes sure that all the methods that are available on String are available without explicit require.
0fe805125287d40bb254a7bd8cb15940dc8ca5f4 2014-10-20 lib/unicode_normalize.rb: (unicode_normalize!): change method names in commented-out code. Followup to r48027.
ce351cf0a79ecbb1ff52eb23754b7e9e73a5fbea 2014-10-20 lib/unicode_normalize.rb: removing unnecessary 'self'.
acaafe2101be7bad03197075c7e1626640929c11 2014-10-20 lib/unicode_normalize.rb: revert r48046. The s in sIndex is not hungarian notation. The variable name sIndex is directly taken from the relevant part of the Unicode Standard, where it is written SIndex and stands for 'syllable index'. See pp. 144/145 of http://www.unicode.org/versions/Unicode7.0.0/ch03.pdf.
10cb4ca0c47d11e5ac32996e1efa4d7668659332 2014-10-20 tool/downloader.rb: Make sure we update to latest version if network connection is available, but we don't fail if there is no network connection but option -e is set and we already have a version of the file.
27cbf1f50d28f916d7a0157a2cac0d8e2acec1e2 2014-10-20 config.mk: Added missing data files as targets for prerequisite update_unicode.
7415796ca397ab293ded29122de30644ed437497 2014-10-21 lib/unicode_normalize/normalize.rb: Added comment to point to relevant portion of Unicode standard for Hangul (de)composition identifiers and algorithm.
5c27164d594997ae91442f46cbab4eace69a17f0 2014-10-21 lib/unicode_normalize/tables.rb: Committing to make version update easier and more predictable, and reducing compilation time.
69488371e4a9f93c09696ab7dc2126d62d44c7ed 2014-10-21 common.mk: Adding explicit creation of directory enc/unicode/data because git doesn't handle empty directories.
15bfe2793a849f500a15ae54553f4ffba5d55b41 2014-10-22 test/test-unicode_normalize.rb: Importing from https://github.com/duerst/eprun/blob/master/test/test_normalize.rb. (removing trailing whitespace, fixing EOLs and adding EOL property)
30b837b47c034ac01256936f8c09a49bba38e979 2014-10-22 test/test-unicode_normalize.rb: Removed explicit require, changed method names, adjusted copyright.
84f9fa87730bb45a9cabcc7a7907913d81ad7781 2014-10-22 test/test-unicode_normalize.rb: Adjusted path for test data file (now ../enc/unicode/data/NormalizationTest.txt).
71c534c962152295778655ea6bfb86331b970776 2014-10-22 tool/unicode_norm_gen.rb: Fixed escaping of backslash and double quote ('\\\&' -> "\\\\\\\&"; double quoted string is needed to make \& mean last match; double double backslashes are needed because of two layers of escaping).
0263002e2f6ec38bd89aebf2aa1edeada988e4c2 2014-10-22 common.mk: Fixed grammar in comment [ci skip]
df25ba40152c936d381a476f4c711f99f5033680 2014-10-23 NEWS: Added String#unicode_normalize(|!|d?) [ci skip]
c0988c87c42c43b82b7189b87d8c173ff32e78b9 2014-10-25 template/unicode_norm_gen.tmpl: Adjusted name of generating file.
b3494924b6b3aa1860b3d71b6e014e50be8ef0db 2014-10-25 test/test-unicode_normalize.rb: removed unused function.
bdea9273ca684065bd017e188a1ebc5b08c13ee7 2014-10-25 test/test-unicode_normalize.rb: added test_raise_exception_for_non_unicode_encoding.
4fda6198369cb7a83e35e6456a00365ee3095497 2014-10-25 lib/unicode_normalize/normalize.rb: added US_ASCII as trivially supported encoding (is always normalized, and may appear mixed in with UTF-8 or other Unicode encodings).
96d375540f390579fa93024819fb14ca05b9cd5e 2014-10-25 test/test-unicode_normalize.rb: added test_us_ascii.
d26c49657db909ccfa4434bdc975a3278db749fc 2014-10-26 string.c: improved comment.
68e19ae8559f37221a2524eca565aa476cb36f76 2014-10-26 template/insns.inc.tmpl, insns_info.inc.tmpl, known_errors.inc.tmpl, minsns.inc.tmpl: fixed path of generating script.
512c0364b3f526bef5bed1b224fbe6bb7f253eff 2014-10-26 commit miss
bd38e0c4f2a065ccf824ad41db0091e3290aa111 2014-10-30 tool/downloader.rb: changed Unicode data download location from latest Unicode version to Unicode 7.0.0.
2b7f0289f80be830cc15780e7211ddcec5a3f7ce 2014-11-05 lib/unicode_normalize/normalize.rb: Comment clarification. [ci skip]
62b511b6aa9e2c1ceb11048f012eed13e5feabfb 2014-11-09 lib/unicode_normalize/normalize.rb: Replaced if-else by case in self.normalized? in parallel to r48309.
24f5f816ff03621897a4c8b18393d3d027d1f02a 2014-12-02 array.c (rb_ary_plus): in documentation, added note about
b1cf08496bd4eed402abe084552bf7dcf5a8fe09 2014-12-30 lib/uri/common.rb: Initialize HTML5ASCIIINCOMPAT to empty Array to avoid error during bootstrap when encodings are not yet defined. [Bug #10678]
e3c4c7e13bcf5b202701ef5acc86b06187b5fdb7 2015-01-14 * lib/uri/mailto.rb: raising URI::InvalidComponentError instead of failing with undefined method `split' for nil:NilClass for mailto: URIs without opaque part. [Bug #10738] * test/uri/testuri.rb: Test for above
1fb502caca35b300fe6f852a89b4bef6bf73ad97 2015-09-29 tool/unicode_norm_gen.tmpl, lib/unicode_normalize/tables.rb: get rid of many .freeze commands by using frozen_string_literal pragma.
ae8c13f517eeae8234247aa244d0a271f819dd6a 2015-10-02 common.mk, lib/unicode_normalize/tables.rb: Change Unicode Version for character normalization data from 7.0.0 to 8.0.0.
927e18b3701662975182d1c0caeb9477af70817e 2015-10-24 vm_insnhelper.c: improved error message for "wrong number of arguments", distinguishing given and expected argument numbers clearly. [Feature #9025]
8e465f9c99a8a8db4ab36d21b87d29b9acf99fc0 2015-11-17 NEWS: Small grammatical fix
5d3cdd6556537bc4f80209d29c7dac66e8a7a0a5 2015-11-17 enc/unicode/casefold.h, name2ctype.h: Change Unicode Version for regular expressions from 7.0.0 to 8.0.0 (with help from Kimihito Matsui) [Feature #11563]
ec217619b6bc48650905055c365ce7207328cc24 2015-11-17 NEWS: Added update from Unicode 7.0.0 to 8.0.0 [ci skip]
dddbd864d1cb9119363caea98bb2ba8968d10828 2015-12-04 enc/iso_8859_4.c: Added missing lower/upper-case character pair (U+014A and U+014B, LATIN CAPITAL/SMALL LETTER ENG)
e1f69b39ae7b4dbd91525b38ba229bdc5d9a4fcf 2015-12-04 enc/iso_8859_13.c: Added three missing lower/upper-case character pairs (from Kimihito Matsui)
e924e657b6f9ab1bf359b56cb8137e6ab73e9b4f 2015-12-07 .gitignore: added cygwin*.def for Cygwin
d9c6135c5bf540055ec76e27fef3f4a66057cf70 2015-12-08 string.c: removed unused variable
373489bb4e292b2aea249ed6de71eba883ea1610 2015-12-13 KNOWNBUGS.rb: Fixed typo, made more explicit [ci skip]
83304b75c17b5126c8e8448f2cc08350146157d0 2015-12-14 * enc/ebcdic.h: new dummy encoding EBCDIC-US * enc/trans/ebcdic.trans: transcodings between EBCDIC-US and iso-8859-1 [with code from Andrea Ribuoli] * test/ruby/test_transcode.rb: tests for above * tool/transcode_tablegen.rb: additional argument for method transcode_tblgen
f262863188e86756c63e42fdb129dfad6fe98db8 2015-12-15 tool/transcode_tablegen.rb: detailled documentation for transcode_tblgen function [ci skip]
81e7be08ed1feaa3be683aeaa5fdbc79ed0489eb 2015-12-15 enc/ebcdic.h, enc/trans/ebcdic.trans, test/ruby/test_transcode.rb: Fixed encoding name to the correct one in the IANA registry (IBM037) and added an alias (ebcdic-cp-us)
7274abb7b0e159b3f5d4b9046a0d33b60ca928aa 2015-12-15 NEWS: added news about EBCDIC encoding
bf6dd5c2f761e9d9edbcba370f6d0503006cbc76 2015-12-15 fixing ChangeLog problem
1db69bf3ef69445fcaa6bd1dd9e9ec9e8ff0632a 2015-12-15 reverting accidental commit at r53124 by re-committing version from r52612
6973c8d51e689fcd5fc44647b7352b83168c4099 2015-12-19 enc/iso_8859_2.c, enc/windows_1250.c: separate Windows-1250 from ISO-8859-2 to fix 0x80..0x9e range (from Kimihito Matsui)
0ab9e706258119caf3af0de1b0e94b86d2567931 2015-12-25 doc/CangeLog-2.3.0, ext/tk/lib/txextlib/SUPPORT_STATUS, include/ruby/version.h: minor grammar fixes [ci skip]
eb86cd6b7b8f6bbbe03a1667113c4755e68f8290 2016-01-11 include/ruby/oniguruma.h: Added flags needed for upcase/downcase Unicode addition (with Kimihito Matsui)
2788cd98491e4fd53881c855f1361d5722fd012c 2016-01-12 string.c: Added option parsing/checking for upcase/downcase/ capitalize/swapcase (with Kimihito Matsui
92dc9e55be9ee3faeedb776f2c194703d0a54a9b 2016-01-12 test/ruby/enc/test_casing_options.rb: Tests for option parsing/checking for upcase/downcase/capitalize/swapcase (see r53503; with Kimihito Matsui)
f23658f1c10b33b51805a9286f1dcdf0ab95b63f 2016-01-12 string.c: minor grammar fix [ci skip]
4a5d3572e62ba11f55c704eb391aec6710205a28 2016-01-12 string.c: made a variable name more grammatically correct
be897c2507a9d7710f218ccf377e6ea67d6d47bf 2016-01-16 * string.c, enc/unicode.c: New code path as a preparation for Unicode-wide case mapping. The code path is currently guarded by the :lithuanian option to avoid accidental problems in daily use. * test/ruby/enc/test_case_mapping.rb: Test for above. * string.c: function 'check_case_options': fixed logical errors
c12af76763a1bff53ed77bc4d236f441d8679880 2016-01-16 * enc/unicode.c: Artificial mapping to test buffer expansion code. * string.c: Fixed buffer expansion logic. * test/ruby/enc/test_case_mapping.rb: Tests for above. (with Kimihito Matsui)
959bbb6f7202676f2da1ef5e134e6152e8613b54 2016-01-17 * enc/unicode.c: Removed artificial expansion for Turkic, added hand-coded support for Turkic, fixed logic for swapcase. * string.c: Made use of new case mapping code possible from upcase, capitalize, and swapcase (with :lithuanian as a guard). * test/ruby/enc/test_case_mapping.rb: Adjusted for above. (with Kimihito Matsui)
f307d1fe21371ee27436da79d240ad9ade285257 2016-01-17 * enc/unicode.c: Fixed a logical error and some comments. * test/ruby/enc/test_case_mapping.rb: Made tests more general. (with Kimihito Matsui)
e580847ce8b54899d447c8561334cceccc20f179 2016-01-17 * string.c: Any kind of option is now taking the new code path for upcase/downcase/capitalize/swapcase. :lithuanian can be used for testing if no specific option is desired. * test/ruby/enc/test_case_mapping.rb: Adjusted to above. (with Kimihito Matsui)
3dcdb2c3b72cd4200672a32e23c82bd1d499183a 2016-01-19 * common.mk: Added Unicode data file CaseFolding.txt to be additionally downloaded (with Kimihito Matsui)
39988ff0ac3f091c7faaf160f7a90c66c6e90ba9 2016-01-23 * tool/downloader.rb: Fixed a logical error, improved documentation
0d3d8a8cda9436627573b1ad2160a30b17b679fd 2016-01-23 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@53633 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
387e838c024cbe59c5df297d8ad36012406e38d1 2016-01-23 ChangeLog: Fixing wrong time on previous commit, and adding previous commit message to svn [ci skip]
615d99b7e7cd2865a5b70d0ebd84f0eb890a8612 2016-01-24 * common.mk: Simplifying Unicode data file download logic to make it more reliable (including additional fix not in r53633) [Bug #12007]
415949faba4dc741bd03e0ab34fdcfeb894b8b61 2016-01-27 * enc/unicode.c: Protect code point count by macro, in order to be able to use the remaining bits for flags. (with Kimihito Matsui)
a7c987968de91bcc4cada65ce90b23c2f89957e3 2016-01-27 * enc/unicode.c: Fixed bit mask in macro OnigCodePointCount
6ec7e30ce06a7c6fd18c92e84caf2abe1d92b4ef 2016-02-04 * common.mk: Introduce two variables (UNICODE_DATA_DIR and UNICODE_SRC_DATA_DIR) to eliminate repetitions.
bb4f87127f370fc51496f51d5cae55103ddebfd1 2016-02-04 * common.mk: Added Unicode data file SpecialCasing.txt to be additionally downloaded (with Kimihito Matsui)
b658249cef3c0032be91e499dfc04cd00642d354 2016-02-04 * enc/unicode.c: Activated :ascii flag for ASCII-only case conversion (with Kimihito Matsui)
6ab70ad72aed0909868f59135f6f5d1e49877323 2016-02-05 * test/ruby/enc/test_case_mapping.rb: added tests for :ascii option. (with Kimihito Matsui)
81515b2381dcd325ca57c0272a551bba4f112afd 2016-02-06 * enc/unicode.c, test/ruby/enc/test_case_mapping.rb: Implemented :fold option for String#downcase by using case folding data from regular expression engine, and added a few simple tests. (with Kimihito Matsui)
b217dc81f3e08207a381b9c7cdb86d4909c579e6 2016-02-06 * test/ruby/enc/test_regex_casefold.rb: Tests for three case folding primitives (mbc_case_fold, get_case_fold_codes_by_str, apply_all_case_fold) in the various encodings. Currently only known good encodings are tested to avoid test failures. For bug hunting, start by adding more encodings with generate_test_casefold encoding (with Kimihito Matsui)
e8dde46b6064c07e2b11a42046e2948bcbc2dd06 2016-02-06 * test/ruby/enc/test_regex_casefold.rb: Added data-based testing for String#downcase :fold. * enc/unicode.c: Fixed a range error (lowest non-ASCII character affected by case operations is U+00B5, MICRO SIGN) * test/ruby/enc/test_case_mapping.rb: Explicit test for case folding of MICRO SIGN to Greek mu. (with Kimihito Matsui)
07ad033850328cf2f19a9e1521eb631b7059b5ed 2016-02-07 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@53759 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
1e7770d372486af5b67e6964e1ce9139d651cf46 2016-02-07 (this commit message applies to the previous commit) * common.mk: explicit recipe for enc/unicode/casefold.h * enc/unicode/case-folding.rb: Adding -m option to prepare for using multiple data files. (with Kimihito Matsui)
ee696372a8840135c06e5605cf90593ff672b522 2016-02-07 * common.mk: using new option in recipe for enc/unicode/casefold.h * enc/unicode/case-folding.rb: Correctly specify argument to new option. (with Kimihito Matsui)
5cbe9fe5c4c1a8f9a077663cd3e754dc89b39b59 2016-02-07 * enc/unicode/case-folding.rb: New classes CaseMapping/CaseMappingDummy to pass as parameters; not yet implemented or used. (with Kimihito Matsui)
e40d1173397becd705ee55fddde51432ec4638a2 2016-02-07 * enc/unicode/case-folding.rb: Fixing parameter passing. (with Kimihito Matsui)
49ca434bcf64f115bf1915f5287fbb61598ed773 2016-02-07 * common.mk: Added two more precondition files for enc/unicode/casefold.h * enc/unicode.c: Added shortening macros for enc/unicode/casefold.h * enc/unicode/case-folding.rb: Fixed file encoding for CaseFolding.txt to ASCII-8BIT (should fix some ci errors). Clarified usage. Created class MapItem. Partially implemented class CaseMapping. (with Kimihito Matsui)
c1db3a49a4c13370e7d814a1847e4fa54355d332 2016-02-08 * common.mk: Removed enc/unicode/casefold.h from automatic build because some CI systems don't have gperf. Creation of enc/unicode/casefold.h is now possible with make unicode-up. This is intended as a temporary measure.
8f10a72d909bea83bca029375516dbcb02286518 2016-02-08 * enc/unicode.c: Shortened macros for enc/unicode/casefold.h to single-letter; use flags in casefold.h for logic. * enc/unicode/case-folding.rb: Added flag for case folding. Changed parameter passing. * enc/unicode/casefold.h: New flags added. (with Kimihito Matsui)
2ca7569c6d2a833977b2bc55329a7c49d87e21df 2016-02-08 * string.c, enc/unicode.c: Disassociating ONIGENC_CASE_FOLD flag from ONIGENC_CASE_DOWNCASE. (with Kimihito Matsui)
73ab88994f4ce23e65c4ebedb87fd687281e54d2 2016-02-08 * enc/unicode/case-folding.rb, enc/unicode/casefold.h: Flags for upper/lower conversion added (titlecase and SpecialCasing still missing) (with Kimihito Matsui)
c3554cdea6a1a4993034e1846546d9cb4c0a933d 2016-02-15 * enc/unicode/case-folding.rb: Added debugging option (with Kimihito Matsui)
6a808bda64f4a6a2e978d71ae0a957906dcb2dd5 2016-02-16 * enc/unicode/case-folding.rb, casefold.h: Used only first element (rather than all) of target in CaseUnfold_11 array. (with Kimihito Matsui)
6286ff6301f94a475702d905cb722d3a4c7c4535 2016-02-19 * enc/unicode.c: Activated use of case mapping data in CaseUnfold_11 array. (with Kimihito Matsui)
5470ce82067948f6d6a18a7805de3c64a2666ddf 2016-02-22 * enc/unicode/case-folding.rb, casefold.h: Fixed bug that avoided inclusion of compatibility characters in uppper-/lower-case mappings. (with Kimihito Matsui)
4ca9138bac7fd93501581f65d73435eed59a3087 2016-02-22 * enc/unicode/case-folding.rb, casefold.h: Adding flag for title-case, not yet operational. (with Kimihito Matsui)
8aa8847b7c69bd9de812f4fefa39c9c4c4cb7b21 2016-02-23 * enc/unicode/case-folding.rb, casefold.h: Reading casing data from SpecialCasing.txt. (with Kimihito Matsui)
1cc579cb0036c4178f4279b9d505d0f6ba5d375a 2016-02-23 * enc/unicode/case-folding.rb, casefold.h: Outputting actual titlecase data (new table, with indices from other tables). * enc/unicode.c: Ignoring titlecase data indices for the moment. (with Kimihito Matsui)
f1f48e610311d812eaf408d0947e8a3686ed3364 2016-02-24 * include/ruby/oniguruma.h: Rearranging flag assignments and making space for titlecase indices; adding additional macros to add or extract titlecase index; adding comments for better documentation. * enc/unicode.c: Moving some macros to include/ruby/oniguruma.h; activating use of titlecase indices. (with Kimihito Matsui)
7feb182a08fb3f2a3a9e693f20947818de9a38cc 2016-02-25 * enc/unicode/case-folding.rb: Adding possibility for debugging output for TitleCase table in casefold.h. (with Kimihito Matsui)
c4e6964141ef8ec7f42cc8ded29434b3a235c0ae 2016-02-27 * enc/unicode/case-folding.rb, casefold.h: Reducing size of TitleCase table by eliminating duplicates. (with Kimihito Matsui)
59766643db17f8dbfe518cafa20f6ba36a9b8d9b 2016-03-11 * enc/unicode/case-folding.rb, casefold.h: Streamlining approach to case mapping data not available from case folding by unifying all three cases (special title, special upper, special lower). * enc/unicode.c: Adjust macro names for above (macros are currently inactive). (with Kimihito Matsui)
4b15b54d680bbb6e97673e67085ed4f70b612ccb 2016-03-14 * include/ruby/oniguruma.h, enc/unicode.c: Adjusting flag assignments and macros to work with unified CaseMappingSpecials array. (with Kimihito Matsui)
00cc59a0547a953cbf162f24585c8e6edaa58df3 2016-03-15 * enc/unicode.c: Additional macros and code to use mapping data in CaseMappingSpecials array. (with Kimihito Matsui)
8679f113e949e2484f105fa27a1334342c244b3c 2016-03-15 * enc/unicode.c: Expansion of some code repetition in preparation for elimination of common code pieces. (with Kimihito Matsui)
e89232eb15857daae7bc8cbeec98c253a6e8a9f2 2016-03-15 * enc/unicode.c: Eliminating common code. (with Kimihito Matsui)
0052ed9d806f3739df9200891a376a18dc8068f9 2016-03-16 * test/ruby/enc/test_case_mapping.rb: Adding tests for actual Unicode case mapping. Fixing some aliasing issues. (with Kimihito Matsui)
417f07d996262ae00ebd59d32ed025929b318d2b 2016-03-16 * test/ruby/enc/test_case_mapping.rb: Adding tests for Cherokee. One test not yet working. (with Kimihito Matsui)
bbef7b401f4d738b94892d0d80ba43832871f941 2016-03-16 * test/ruby/enc/test_case_mapping.rb: Fixed a logical error.
50591a768de9e2381789a8aeed1bc2c654c18e83 2016-03-16 * test/ruby/enc/test_case_mapping.rb: Fixed and activated a test for Cherokee.
fdbb82967f54491b3b209af8f2d4aeedfb5ea214 2016-03-17 * enc/unicode.c: Fixed two macro definitions. * test/ruby/enc/test_case_mapping.rb: Test cases that detected the above bugs.
2f455ceca40dcddb0f92c2918796673187cd8ed4 2016-03-22 * include/ruby/oniguruma.h: Additional flag for characters that are titlecase. * enc/unicode/case-folding.rb, casefold.h: Using above flag in data. * enc/unicode.c: Marking capitalized character as unmodified if it is already titlecase. * test/ruby/enc/test_case_mapping.rb: Tests for above functionality.
8006157cbadb382b1b37e2fbd6a9e3092814db5d 2016-03-25 * test/ruby/enc/test_case_mapping.rb: Additional tests title case; some not yet activated.
663fb4dd44cb55388ef647f533cdd2cd578319f3 2016-03-28 * enc/unicode.c: Adding comments. [ci skip]
1582093c776af309314c4cf1598e81af46bf2117 2016-03-28 * enc/unicode.c: Removed code now covered by data from table.
890ce36b797fe3e91d039f5fafcba3681f342557 2016-03-28 * enc/unicode.c: Minor refactoring for I WITH DOT ABOVE.
2d20a27fb4ee8d3401627e538cb68019704d7bae 2016-03-28 * enc/unicode.c: Refactoring in preparation for data reduction for titlecase.
0e6f8b166d6155503bf490a9853c0f0c2fbd135b 2016-03-29 * enc/unicode/case-folding.rb, casefold.h: Removing data for idempotent titlecasing. * enc/unicode.c: Adjust code to data removal.
49f25a1299e04d3423351ab9bc212b8cdd6547a3 2016-03-29 * enc/unicode.c: Cleaned up some comments.
78f540019a394421e1875cacaf956e8c23b18cc0 2016-03-29 * enc/unicode/case-folding.rb, casefold.h: Tweaked handling of 6 special cases in CaseUnfold_11_Table. * enc/unicode.c: Adjustments for above. * test/ruby/enc/test_case_mapping.rb: Tests for the above: Some tests in test_titlecase activated; test_greek added. A test in test_cherokee fixed.
5e9d33ad49079877ee9461e0449150a28bf0902d 2016-04-01 * enc/unicode/case-folding.rb, casefold.h: Data generation to implement swapcase functionality for titlecase characters. Swapcase isn't defined by Unicode, because the purpose/usage of swapcase is unclear anyway. The implementation follows a proposal from Nobu, swaping the case of each component of a titlecase character individually. This means that the titlecase characters have to be decomposed. * enc/unicode.c: Code using the above data. * test/ruby/enc/test_case_mapping.rb: Tests for the above.
65db16de9f7f9fb806dea4bf072e5585cdd42ee2 2016-05-16 * include/ruby/oniguruma.h: Introducing ONIG_CASE_MAPPING compilation switch * include/ruby/oniguruma.h, enc/unicode.h: Using ONIG_CASE_MAPPING compilation switch
84cd51919bf35676d1ee45a75f969fba3c8fac1f 2016-05-16 * enc/unicode.h: Additional uses of ONIG_CASE_MAPPING compilation switch
14b524b6b01bde70f207a049bd78899eb88823b8 2016-05-22 regenc.h/c: Define new function onigenc_not_support_case_map
78c5ca70744e85f8468147ddc362962bcd987d30 2016-05-22 * include/ruby/oniguruma.h: Extend OnigEncodingTypeDefine to define a new encoding primitive 'case_map' for case mapping * enc/utf-8.c, utf_16be/le.c, utf_32be/le.c: add onigenc_unicode_case_map as case_map primitive * enc/ascii.c, big5.c, cp949.c, emacs_mule.c, euc_jp/kr/tw.c, gb18030.c, gbk.c, iso_8859_1/2/3/4/5/6/7/8/9/10/11/13/14/15/16.c, koi8_r/u.c, shift_jis.c, us_ascii.c, windows_1250/1251/1252.c: add onigenc_not_support_case_map as case_map primitive
de305716c2166156e96e361962fe4042ff95e548 2016-05-22 * rename test/ruby/enc/test_casing_options.rb to test_case_options.rb for consistency
0bf2e5c70bd153db6dd0d49552ad4db1a18a6097 2016-05-22 * test/ruby/enc/test_case_options.rb: adjust test class name to match file name
ef6405f71cbc73864a4ee4a46c9ab6d5ccd16959 2016-05-24 * enc/unicode.c: Fix flag error for switch from titlecase to lowercase. * test/ruby/enc/test_case_mapping.rb: Tests for above error.
46647ac8df475c062f83f271efa53c544d22d1ba 2016-05-25 * enc/unicode.c: Handle DOTLESS_i by hand because it isn't involved in folding.
c1fedc8de74542fb6ba06b4e1348889780dde7c2 2016-05-26 * test/ruby/enc/test_case_comprehensive.rb: Add set of comprehensive (across most Unicode characters; later across most character encodings) tests for case mapping.
6375ad2815f46828888cea35f71651a844c74e9e 2016-05-28 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55187 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
288878041f3ef5faed688a90782e5dea62a10cb6 2016-05-28 * test/ruby/enc/test_case_comprehensive.rb: Converted exception for unavailable Unicode data files to failed assertion.
62efd110f2a5a5d193a036917a256bfb516ba7e4 2016-05-28 * test/ruby/enc/test_case_comprehensive.rb: Add error messages to tests for data file availability; refactoring.
3883bb6baeabd8c60e9c01c43714cef3a99fa87a 2016-05-28 * test/test_unicode_normalize.rb: Add test to check for availability of Unicode data file; refactoring; fix an error with tests for destructive method (unicode_normalize!).
85950c52570e9d79e9108918db5d37abd9e16a32 2016-05-30 * string.c: Document current situation for String#downcase. [ci skip]
ae4fba31672956462d8d88c9e473f9cbf30e2f2a 2016-05-30 * string.c: Document current behavior for other case mapping methods on String. [ci skip]
3fb0fcd1e881c1f6dd74db73a64e8623208acb77 2016-05-31 * string.c: Activate full Unicode case mapping for UTF-8 by removing the protective check for the presence of an option. Update documentation. * test/ruby/enc/test_case_comprehensive.rb: Adjust tests for above change.
3dd98b2446271f8ad2837bc4b63c459c131c579f 2016-06-02 * string.c: Raise ArgumentError when invalid string is detected in case mapping methods. * enc/unicode.c: Check for invalid string and signal with negative length value. * test/ruby/enc/test_case_mapping.rb: Add tests for above. * test/ruby/test_m17n_comb.rb: Add a message to clarify test failure.
c5ea268264b006dbdeebbf66c847e4e3c778310b 2016-06-05 * string.c: Remove :lithuanian guard for Unicode case mapping.
2cd799b9676584b0b69c351285014477d1edc501 2016-06-05 * test/ruby/enc/test_case_comprehensive.rb: Remove :lithuanian guard for Unicode case mapping.
8ba27d7312f811930d9676e4c5c574e411b00ea7 2016-06-05 * NEWS: Add news about Unicode-wide case mapping for String/Symbol#upcase/downcase/swapcase/capitalize(!).
b09795d803b41b067c86a100072facb1ba98cfe3 2016-06-06 * test/ruby/enc/test_case_mapping.rb: Remove :lithuanian guard for Unicode case mapping.
2f49aa8f62c5e1c99dabf453fc162129430ebd95 2016-06-06 * string.c Change rb_str_casemap to use encoding primitive case_map instead of directly calling onigenc_unicode_case_map.
a4ccbb63cdeb0bc42d8563011d4884510715867a 2016-06-06 * test/ruby/enc/test_case_comprehensive.rb: Speed up testing for small encodings by preselecting codepoints.
ab5f23f26c844aa89fb380191e8c70062cab60fa 2016-06-06 * string.c: Added UTF-16BE/LE and UTF-32BE/LE to supported encodings for Unicode case mapping. * test/ruby/enc/test_case_comprehensive.rb: Tests for above functionality; fixed an encoding issue in assertion error message.
c8c9eff25c8d6205ddd34a2bc7969d7967354aec 2016-06-07 * regenc.c (onigenc_not_support_case_map): Rewrite to work correctly in ASCII range.
61b3be7dbbd3090df355e4a23e5d73caec0bdc82 2016-06-07 * regenc.c (onigenc_not_support_case_map): Move to end of file; (onigenc_single_byte_ascii_only_case_map): Add new function.
fd7925ffa547d36de6d24f3500593736226b89a6 2016-06-07 * regenc.h/c: Rename onigenc_not_support_case_map to onigenc_ascii_only_case_map. * regenc.h: Add definition of onigenc_single_byte_ascii_only_case_map. * enc/iso_8859_X.c, windows_125X.c, ascii.c, us-ascii.c, koi8_x.c: Replace onigenc_not_support_case_map by onigenc_single_byte_ascii_only_case_map. * enc/big5.c, cp949.c, emacs_mule.c, euc_X.c, gbX.c, shift_jis.c, windows_31j.c: Replace onigenc_not_support_case_map by onigenc_ascii_only_case_map.
b5baea5f87801e5c775c528e28dfc45ef856fcea 2016-06-07 * test/ruby/enc/test_case_comprehensive: Fix method name (generate_casefold_tests -> generate_case_mapping_tests).
1b98612090429966ab2d39413865cd44d4dbfff0 2016-06-07 * test/ruby/enc/test_case_comprehensive: Add regression tests for current ASCII-only case conversion.
53a3e3ddd9e5fc88b9dfec30825cbc9e4e239cd4 2016-06-07 * string.c (rb_str_downcase_bang): Switch to use primitive except if conversion can be done ASCII-only.
8743f010c6f60cd1ac807f30f3e06fa57000ba25 2016-06-07 * string.c (rb_str_upcase_bang, rb_str_capitalize_bang, rb_str_swapcase_bang): Switch to use primitive.
768b13023814b180724023246c65d3942ff76375 2016-06-07 * test/ruby/enc/test_case_comprehensive: Change test for encodings without any non-ASCII case conversions from ASCII-only test to full test.
f0fc6ec872f043836f01455b16feee6bb9ed9eb9 2016-06-08 * string.c: New static function rb_str_ascii_casemap; special-casing :ascii option in rb_str_upcase_bang and rb_str_downcase_bang. * regenc.c: Fix a bug (wrong use of unnecessary slack at end of string). * regenc.h -> include/ruby/oniguruma.h: Move declaration of onigenc_ascii_only_case_map so that it is visible in string.c.
5eb73eeda81b63d3d9d219b36bfd98dc27671a13 2016-06-08 * string.c: Special-case :ascii option in rb_str_upcase_bang.
872f9a498fb5ec52eb54b8ee1f546b0d50817779 2016-06-08 * string.c: Revert previous commit (possibility of endless loop).
13f576d6b9f76c6ab2e6762dc78697309dac3708 2016-06-10 * string.c: Special-case :ascii option in rb_str_upcase_bang (retry).
10174c295b91ef1370d3c910755d79c895d0ef32 2016-06-10 * string.c: Special-case :ascii option in rb_str_capitalize_bang and rb_str_swapcase_bang.
02f7ad6237466d552c21f1cc78acd33febaa5ee6 2016-06-11 * enc/iso_8859_1.c: Implement non-ASCII case mapping. * test/ruby/enc/test_case_comprehensive.rb: Tests for above. * string.c: Add iso-8859-1 to supported encodings.
9fa8b80550d2af9309d2f493138c21969711bf60 2016-06-11 * enc/iso_8859_1.c: Revert to older version of code.
60095cecc70850382b082e423f1740d59ca78798 2016-06-12 * test/ruby/enc/test_case_comprehensive.rb: Add tests for ASCII-only swapcase; store calculated values in hashes.
86e9f85553c1e0e06a0196ba84c901bb5d099d6b 2016-06-13 * test/ruby/enc/test_case_comprehensive.rb: Add tests for full Unicode swapcase.
d282efbeff498f2abaeb3f748a812251e76e39c4 2016-06-19 * localeinit.c: Fix filesystem encoding for cygwin to UTF-8 (see https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-unusual)
b67ead14521fb74bcf8ec28f8c78245dfb536b70 2016-06-19 * test/ruby/test_dir_m17n.rb: Skip tests with non-UTF-8 encodings on cygwin. Cygwin can use the Unicode PUA (private use area) to store bytes from non-UTF-8 filenames (see https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars), but we are not supporting this. [Bug #12443]
fda9ce3edd084c5011fb612a866d015c02fe8513 2016-06-20 * test/ruby/test_in.rb: Skip test_open_fifo_does_not_block_other_threads on cygwin. Fifos seem to work okay in cygwin, but this test repeatedly hangs.
fe875c6e28dfc20f489bdf3b06b270135f4e7b17 2016-06-21 * LEGAL: Added filenames and copyrights for some files in enc/trans/JIS.
8ee8a6f93c0e4f6b62f00e4cb181faf21e81d2f9 2016-06-21 * test/ruby/test_in.rb: Skip test_read_buffer_error on cygwin, because this test repeatedly hangs.
0f8a24a706e1e598a0e4bfaff3df0e0f8fc88d8c 2016-06-21 * test/ruby/test_io.rb: Skip test_readpartial_with_not_empty_buffer, test_read_buffer_error, test_read_unlocktmp_ensure, test_readpartial_unlocktmp_ensure, and test_sysread_unlocktmp_ensure on cygwin, because these tests repeatedly hang. This makes test_io.rb complete in finite time on cygwin. * ChangeLog: Fix test_in.rb -> test_io.rb (two instances).
0546930ecbce703727f1b036441bacb7e0a43799 2016-06-21 * test/ruby/enc/test_regex_casefold.rb: Add Windows-1251, KOI8-R, and KOI8-U to encodings; definitely removed EUC-JP.
4a41a0f7d6bf01c491bd1031d7647b029e328b59 2016-07-03 * test/ruby/enc/test_case_comprehensive.rb, test_regex_casefold.rb, test/test_unicode_normalize.rb: Replace UNICODE_VERSION from UnicodeNormalize with RbConfig::CONFIG['UNICODE_VERSION'] from feature 12460.
79b1758b65563815abb5d626a21b597e39fcfea4 2016-07-06 * enc/iso_8859_16.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-16, by Satoshi Kayama.
6fb97b3f32b491d3f15108f23f90f16f6034248e 2016-07-06 * enc/iso_8859_15.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-15, by Maho Harada.
9511067652171d409936148560365d389e3d17b4 2016-07-06 * enc/iso_8859_1.c, enc/iso_8859_15.c, enc/iso_8859_16.c: Allign indenting to onigmo convention.
1edb021b5ced5c8cff609c1a6181825b829f053b 2016-07-06 * enc/iso_8859_14.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-14, by Yutaro Tada.
0657672a0cecf2c47f988b11fd870fb27d58b266 2016-07-10 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55622 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
6b9e4b81f721bf6fffa8c32f11066bfb1fb6f5b6 2016-07-10 * test/ruby/enc/test_case_comprehensive.rb: Fixed a comment (message belongs to last commit). [ci skip]
3a2553c35672aae9049ec241c8564f9b91c22c79 2016-07-10 * enc/iso_8859_4.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-4, by Kotaro Yoshida.
8f0b58d36addb9466f58e692df69e568c7fdd72a 2016-07-10 * test/ruby/enc/test_case_comprehensive.rb: Changed testing logic in to catch unintended modifications of characters that do not have a case equivalent in the respective encoding. * enc/iso_8859_1.c, enc/iso_8859_15.c: Fixed unintended modifications of micro sign and y with diaeresis.
07ac66ccecb73c1001c822dfa17fc912273af93c 2016-07-10 * enc/iso_8859_10.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-10, by Toya Hosokawa.
b5d869a89d9d20283ec0034270e9c9056a4d07f3 2016-07-11 * enc/iso_8859_4.c, enc/iso_8859_10.c, enc/iso_8859_14.c, enc/iso_8859_15.c, enc/iso_8859_16.c: Replace case-by-case code with lookup in ENC_ISO_8859_xx_TO_LOWER_CASE table.
7253570a838802cb04e99206beb945c427502616 2016-07-11 * enc/iso_8859_1.c: Moved test for lowercase characters without uppercase equivalent.
7b66f0bae9ee8f41b8765b175c9ccbb825a294cf 2016-07-12 * enc/iso_8859_3.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-3, by Takuya Miyamoto. * test/ruby/enc/test_case_comprehensive.rb: Extend special treatment for Turkic.
7c0cb4351a4ec726a7c8fad94c61d1781125ca86 2016-07-12 * revert r55642 (previous commit) because of test failure at https://travis-ci.org/ruby/ruby/builds/144148780
0e292561756eae33f73d6ab93ee3ddb0aeab3dd2 2016-07-12 * test/ruby/enc/test_iso_8859.rb: Excluded dotless i/I with dot from case-insensitive matching because they are not a case pair.
19b5e818dd7191bd4a04d57582aa92e774f3f053 2016-07-13 * enc/iso_8859_3.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-3, by Takuya Miyamoto. * test/ruby/enc/test_case_comprehensive.rb: Extend special treatment for Turkic. * enc/iso_8859_3.c: Exclude dotless i/I with dot from case-insensitive matching because they are not a case pair.
0f3d197da100e11b4897cf02510f867f921c33d8 2016-07-13 * enc/iso_8859_13.c, enc/windows_1257.c: Split Windows-1257 from ISO-8859-13 to be able to implement different case conversions.
336b6b198003408e1d8f058e503463b87359d329 2016-07-13 * enc/iso_8859_13.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-13, by Kanon Shindo.
cbc947885a8d4962721936f703f386d25ae5f12d 2016-07-13 * enc/iso_8859_7.c, enc/windows_1253.c: Split Windows-1253 from ISO-8859-7 to be able to implement different case conversions.
93c1109c19e1475a77c0d68d727423b38acb74d7 2016-07-13 * enc/iso_8859_9.c, enc/windows_1254.c: Split Windows-1254 from ISO-8859-9 to be able to implement different case conversions.
c5682ac4904e5a98d8121e88c8e48426b422666c 2016-07-13 * enc/windows_1254.c: Adjust variable/macro names.
e3600eaca1b74ffc59ac4821a05b48055ec5d0b5 2016-07-13 * enc/iso_8859_5.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-5, by Masaru Onodera. * test/ruby/enc/test_case_comprehensive.rb: Fix order of encodings.
b9cd6920d21acc62fb587663070323bdb3993205 2016-07-13 * enc/iso_8859_1.c, enc/iso_8859_4.c: Avoid setting modification flag if there is no modification.
6a52a5488a7d77c304dbd87e2f6f9176e7c47023 2016-07-13 * enc/iso_8859_7.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-7, by Kosuke Kurihara. * test/ruby/enc/test_case_comprehensive.rb: Fix order of encodings.
9f74ae4cf5ae8a53d0c00532f27d96e9c981f8ff 2016-07-13 * enc/windows_1252.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1252, by Serina Tai. * test/ruby/enc/test_case_comprehensive.rb: Fix order of encodings.
2ac58e689131e368a0148b05856054c0e04d7409 2016-07-13 * enc/iso_8859_9.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-9, by Kazuki Iijima. * enc/iso_8859_9.c: Exclude dotless i/I with dot from case-insensitive matching because they are not a case pair. * test/ruby/enc/test_iso_8859.rb: Make test coverage for ISO-8859-9 a bit more complete.
306b64bd1a05473319ca57e543e53dc56dca4c1c 2016-07-19 * lib/unicode_normalize/tables.rb: Remove UnicodeNormalize::UNICODE_VERSION (#12546).
0ad3557d81dce743dae72944c26a659829137e50 2016-07-22 * template/unicode_norm_gen.tmpl: Remove UnicodeNormalize::UNICODE_VERSION at origin [Feature #12546].
c6692d9410ab5b20ed3fa84db72d51d9a37a6179 2016-07-22 * string.c (String#dump): Change escaping of non-ASCII characters in UTF-8 to use upper-case four-digit hexadecimal escapes without braces where possible [Feature #12419]. * test/ruby/test_string.rb (test_dump): Add tests for above.
6ed393ad89eb821d838d70cf13040d3f39237554 2016-07-24 * regenc.h/c, include/ruby/oniguruma.h, enc/ascii.c, big5.c, cp949.c, emacs_mule.c, euc_jp.c, euc_kr.c, euc_tw.c, gb18030.c, gbk.c, iso_8859_1|2|3|4|5|6|7|8|9|10|11|13|14|15|16.c, koi8_r.c, koi8_u.c, shift_jis.c, unicode.c, us_ascii.c, utf_16|32be|le.c, utf_8.c, windows_1250|51|52|53|54|57.c, windows_31j.c, unicode.c: Remove conditional compilation macro ONIG_CASE_MAPPING. [Feature #12386].
c2b500a000a8128cd1886c6c7b0f7f19eed9fce0 2016-07-26 * test/ruby/enc/test_case_comprehensive.rb: Add explicit skip test for availability of Unicode data files.
c8a1d8b33bc612e6c58e3fbeb3226375bc4f4184 2016-07-26 * enc/windows_1251.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1251, by Shunsuke Sato.
aec1ac6e5127c19d8db89f464e42cca439333974 2016-07-26 * enc/windows_1251.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1251, by Shunsuke Sato.
14dd8a17e84e123299dab248ef4060d817e0d7d6 2016-07-26 * enc/windows_1250.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1250, by Sho Koike. * ChangeLog: Fixed order of previous two entries.
7b2b2869c9ee8155b440872e46ced3549123ee1d 2016-07-26 * enc/windows_1257.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1257, by Sho Koike.
55378a9eb60b2ae4bfb9051f28f5264d8043008d 2016-07-27 * enc/windows_1253.c: Remove dead code found by Coverity Scan.
4abdd6c5aa82b94196611251bb8c6db96ffb93f5 2016-07-30 * enc/iso_8859_2.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for ISO-8859-2, by Yushiro Ishii.
31040a307e5a662301e6c2202c63636550753a6a 2016-07-30 * string.c (String#downcase), NEWS: Mentioned that case mapping for all of ISO-8859-1~16 is now supported. [ci skip]
d25e478e9191d57d2244f8bbce3119c88c812047 2016-09-07 * common.mk: Updated Unicode version to 9.0.0 [Feature #12513] * unicode/9.0.0/casefold.h, name2ctype.h, unicode/data/9.0.0: new directories/files for Unicode version 9.0.0
c0f48f23858dce0c3326d6cb667bd8004cf5e254 2016-09-07 * unicode/8.0.0/casefold.h, name2ctype.h, unicode/data/8.0.0: removing directories/files related to Unicode version 8.0.0
4915ce691d147a460e14531f4031d6934c5e8c5c 2016-09-09 * News: Announcing update to Unicode version 9.0.0 [ci skip]
64b62f40a58408adf7f8fdebdf34ff02a5453d74 2016-10-16 * enc/windows_1254.c, test/ruby/enc/test_case_comprehensive.rb: Implement non-ASCII case conversion for Windows-1254.
2959b5aa1623f26c6cb946963b295c639c027492 2016-10-29 * enc/windows_1254.c: Fix typo. Reported by k-takata at https://github.com/k-takata/Onigmo/commit/ceb59cc. Thanks!
ad619e02c465eebd15fa57dc658f8e042decdebb 2016-11-28 implement String/Symbol#casecmp? including Unicode case folding
ddfa6e2b530c7af963dd03157bd28ade5d789079 2016-11-29 Fix erroneous test of target against target
dacf977a42b7268cc752fe0b3b521d39ab7b05e9 2016-11-29 Fix/improve documentation of String/Symbol#casecmp[?]
87b937bdfd017ed710d1b547ef42b2a0c7e92592 2016-11-30 fix uppercasing for U+A64B, CYRILLIC SMALL LETTER MONOGRAPH UK
56419df9f31d3419bc388843b0ee167ba6ad836e 2016-12-03 change test class name because it is not only about folding
0d2a4cc551090d91badfcc6a63cdbdca53919959 2016-12-03 add tests againts regressions for upcoming codepoint reordering in unfolding table
31fb4e3ec387da3764bea4772858cad55c504a28 2016-12-04 Reorder codepoints in some entries of CaseUnfold_11_Table
8baa73be48a5ed210aad0c7a4a752bad8c1e32e8 2016-12-04 remove special processing for U+03B9/U+03BC/U+A64B
b70d3f67ee03a1cc19242c1442538986dee7542d 2017-01-02 adjust spacing [ci skip]
67c1197835ee0bd697e9085b98e7a869b95014e4 2017-03-17 clarifiy 'codepoint' in documentation of String#each_codepoint
a5330fa9eac68ba8b8ed70ed1713f9524b25105b 2017-03-18 fix accidental reversal of r57997 in r58000
42b8713703606011cff7dd4ed488fe9adbcebb2a 2017-05-02 rework definition of String#unicode_normalize
8001dae8208ef3c3eedc09776c1efd3c74bf0b6a 2017-05-02 rework definition of String#unicode_normalize! and #unicode_normalized?
107f15728973488ce54bf299751d28236f75a8f4 2017-05-02 remove unnecessary conditions in lib/unicode_normalize.rb
5fee67c9bade0337b5c2c88a7fa1c1068b477803 2017-05-03 move definition of String#unicode_normalize to C to make sure it is documented
90ab1ee023ecdc16913e3aa9ca013eb7781068c1 2017-05-04 move definition of String#unicode_normalize! to C to make sure it is documented
140560e4eeff4ffbf9d63183db25234f46943ab6 2017-05-04 move definition of String#unicode_normalized? to C to make sure it is documented
f47033e237cb474c43dea364698d8b327f4a0813 2017-05-04 refactor common parts of unicode normalization functions into unicode_normalize_common
1e793b64e924a5ff34cfdee133a3ec71efee8975 2017-05-04 remove enc/prelude.rb from list of prelude files in common.mk
05e82555e1677d75cb054b1ff1a426e80bd28013 2017-05-04 revert 58559, because it fails on travis (rubyspec problem)
0324e76d56659ef3fa0c5e90195ab2d9059e5e2e 2017-05-06 remove enc/prelude.rb from list of prelude files in common.mk
c7baa31c7683e2f8b95950f112a38dfd33e5a2ad 2017-05-06 delete lib/unicode_normalize.rb, because no longer needed
f86766970f9cd717e96fdb0ff4e8dc2799d2a50b 2017-05-06 delete enc/prelude.rb, because no longer needed
88892c8d65dd0ff1aa1b16ea576ad0a07a2dc80b 2017-05-09 add explanations about status of module UnicodeNormalize
a4301ec2145ac8912241a2b520807d6947d217ce 2017-05-09 replace hand-written argument check by call to rb_scan_args in unicode_normalize_common
0cbe2cfd6516da15fb5414a4fbadbe305237a7ad 2017-05-19 improve examples, fix one improbably should_not value
94ddec6f9c2eb8beef16ce26201b2e1af4da0daf 2017-05-19 add specs for Unicode-wide case conversions introduced in Ruby 2.4
a03690ae73baf31280015318f377338599c60bc6 2017-05-29 Change max byte length of UTF-8 to 4 bytes
e07bff3ce330d8b47814d6cc3cd28a19dd5300df 2017-05-29 revert r58954 temporarily
11954049fa5604703fe2fca7e3a299f27ec4abda 2017-05-30 Change max byte length of UTF-8 to 4 bytes
d0e7329df3ea43cb9e469ad32aafdbf7267ba2d0 2017-06-01 improve English for a warning message
267323cad5ca4888dae11c9e08ccef175f7d72c7 2017-06-06 add test_ill_formed_utf_8_replace to test recommended number of \uFFFD
b3f38b57e72a9904e6c337196d5077638929e9c2 2017-06-28 add two tests
7c077f7c2de19d18ec05ecc31d3209fe47999194 2017-08-06 add some more assertions to test_ill_formed_utf_8_replace
fda037153591dc1ceee4f96cc9f770a414786a12 2017-08-14 fix documentation of REXML::Formatters::Default (id_hack -> ie_hack) [ci skip]
3d46d51c45b608000d1de02fffa80f0901959e67 2017-09-01 replace copyrights by explanatory text in data files for GB2312/GB12345 mappings
04547c7dc00a425692181130e3e771fdde04e8ad 2017-09-06 update Ruby to Unicode 10.0.0
df155f092ce547d69033b83879d9e961faed095d 2017-09-06 remove Unicode 9.0.0-related files
1704e7a6a7f797a675a50d63e5a024652ad7f18a 2017-09-06 add mention of upgrade to Unicode 10.0.0 to NEWS file
dbb6e0a47c875a6e9ce91e5fdbe094b2284108ab 2017-09-27 improve grammar in documentation of Array#bsearch [ci skip]
a6a1d7ab21338f48ca42b4ad4d1c1d1fd6ed3a38 2017-12-12 mention support of emoji-related Unicode character properties in NEWS
90de118a75921a08497b41de2950971073b3c924 2017-12-12 switch from http to https for Unicode data file downloads (patch from MSP-Greg (Greg L), this closes issue #13962)
abb743884990b385b197eb765dcab68f78bddd2d 2018-07-16 tool/downloader.rb:
fb1059312d2a40e6437853ad2e40049d55308860 2018-07-28 update unicode data files directory for normalization tests
9eb6304aa944183fa0e60a30a3c41a23a4ae1917 2018-07-28 always run unicode normalization that do not depend on data file
a7acec675051f8ed49bbc3ab992ac668e5c29fcf 2018-07-28 fix range check for Hangul jamo trailers in Unicode normalization
4fcd9d708fadb1ecef373eb6d5c7d2883b0d1419 2018-09-05 define ubf_list_atfork() as empty on cygwin
7223582866c297fade5c9beba1d400e098923d2e 2018-10-16 add some comments to enc/unicode/case-folding.rb [ci skip]
33b5c610a6f94853be2557594feb861b90508f56 2018-10-16 update to Unicode 11.0.0 (basic step, not complete yet)
a5818630f8f13ee1b52a45f9e3d14caf9c4b55c9 2018-10-16 revert r65091, r65090 because ci fails
2d5b57d63c235ec5bd0814552b20eb00604b5f65 2018-11-23 prepare for Unicode 11.0.0 update
b5a1a3aa7cc3326b0d43afebb10dd3869fe93842 2018-11-23 add GraphemeBreakeTests.txt file to UNICODE_PROPERTY_FILES for download
7599b3f6c66036d235ff850d4e30a7ae10693fe7 2018-11-24 add tests using Unicode test data for grapheme clusters
33afc9e0b9c51ac8b7dae57219316c91892df498 2018-11-24 remove guard against bug #15337, because it is fixed
fc6243a6a6ef4fa1a241169342ad786dd148e3c7 2018-11-25 deal with ONIGENC_CASE_IS_TITLECASE flag on lowercase characters
9a4d1207367234939e9b15eec841250a606807c2 2018-11-25 create function create_property_node to extract recurring functionality
0889346de5cf06e5888a340f9b1f39b82d8849fe 2018-11-25 add several emoji data files to UNICODE_EMOJI_DOWNLOAD for download
0409290ec077770f48f2274324b9986fa49d5da7 2018-11-26 add tests for grapheme clusters using Unicode Emoji test data
51196cf518b1ee5b368cf8a835fc2789a7ca05cb 2018-11-26 improve messages for test failures
2e075759148689af6b771afb863e88ebbde89c2b 2018-11-27 add some comments in function node_extended_grapheme_cluster() [ci skip]
b62e466fb5a0e520143ff421403497e7aa752c8e 2018-11-27 use 'Regional_Indicator' script property instead of fixed constants
aa4493596932154ffd84c0e9f9e2f863105b529f 2018-11-27 use explicit property name when creating nodes for "Grapheme_Cluster_Break=Extend"
69443998cdca2431b7bd4adf22a776011acfda7c 2018-11-27 introduce helper function quantify_node() to wrap function node_new_quantifier
8f9c00d2074f7226ec206ff72dd8ec1e6775743d 2018-11-27 introduce helper function quantify_property_node()
9e2455b7568e6f562c9acb7071532f2cd504d48c 2018-11-27 introduce helper function create_sequence_node()
7086aae378d85dd84f5ca3dc8c374bb4cabe29b0 2018-11-27 make sure all nodes are correctly freed in create_property_node()
81868891205c4cbe8f5e3f16b0642412c5dbcaa0 2018-11-27 declare array for sequence at start of code creating sequence
99d451f5c5d4d62fb4a4b2e728c74448eef16201 2018-11-27 correctly handle return value from create_sequence_node()
06bd42a7221e85709274830d565157b44e21e964 2018-11-27 introduce two more uses of create_sequence_node in node_extended_grapheme_cluster
7c4a422d835409e76b53a03d77aa2beeec44abb8 2018-11-27 fix order of subexpressions for Hangul
42cb4feda167bcab52d6d1d95f45c48b3eb62723 2018-11-28 reduce number of arguments on quantify_property_node()
97a8334cd301fa1cdde12fb44a54da573e2a9aef 2018-11-28 introduce macro R_ERR to reduce repetitive code
b4e39021f20afeb03a03a02d1ffd47adaed3cd6b 2018-11-28 use create_sequence_node() once more
33c7fa6501e56fb701c87d08d626521e6980205c 2018-11-28 use create_sequence_node() four more times
d3b7a10dcc0f311ac24f24d2599bb243dff88d73 2018-11-28 eliminate intermediate variable in very short block (3 times)
c80aeb527e855950823f252ff382ea24a03a0c2d 2018-11-28 remove two unnecessary variables (np2 and np3)
1fa7087f10eda0c8f47b3874170e4b30bef9e463 2018-12-01 eliminate a list with only one element
3feeed6e991e33ac058b05a46d658aeb89e29725 2018-12-01 introduce and use create_alternate_node()
d1f8694171eed1544a8cc7c29ede155679efa0ca 2018-12-01 remove unnecessary code removing CR/LF from range
d359ee3f553c05949cacdd15efea07ea4a936435 2018-12-02 reformat code [ci skip]
096d362939816b569b22ed3de87c06277ef4c3fb 2018-12-02 add/change some comments in node_extended_grapheme_cluster() [ci skip]
1752d13827392df756389f65dfc82bc2a0416fc7 2018-12-02 expand a small comment [ci skip]
a96a594f9978b28d2d374f4a0fc15f5a2224df9b 2018-12-02 solve the genie/zombie/wrestlers bug
f43a2a5a49fc1995e70e60d98151588f106b2530 2018-12-02 make sure all nodes are freed on error in node_extended_grapheme_cluster()
4a8531db5df048bc1f46154b861d751807a08a1d 2018-12-02 fix order of declarations and code at start of node_extended_grapheme_cluster()
b56e266d6450e21575edf3af355c9723faa0ba97 2018-12-02 remove unnecessary settings with NULL_NODE in \X implementation
db9f1efc4cb93401ce83356903f83caaf26879c6 2018-12-04 change embedding character in TestEmojiBreaks#test_embedded_emoji
84d679794f56de3cc286aa988844a15df2c0cdf5 2018-12-04 exclude skin tones as second component in TestEmojiBreaks#test_mixed_emoji
9b161c3cda4f692e8c31ee4fb722ee54a1f25a2c 2018-12-04 change diaeresis from above to below for better visibility
3a88b4d000db79d9562dd91dc9c9af64cddc3bcc 2018-12-04 commit miss
66a6073859ac6ae2143a9d72162efedece7e1348 2018-12-05 update to Unicode 11.0.0 (main step, not complete yet)
39130926240f87f9bc0123df0db180b71ada4cc4 2018-12-05 adjust some comments in node_extended_grapheme_cluster() [ci skip]
433410a00f8232f90744f1fc7d221c46e4601272 2018-12-05 add update to Unicode 11.0.0 to NEWS [ci skip]
6e97c12e4988a2b4fcbab08c4847cbfd124b0c68 2018-12-05 tweak/remove comments [ci skip]
6cf3ada55c6f92774d49c71664cfa2e59afda201 2018-12-05 remove unused variables in node_extended_grapheme_cluster()
e824e21beb4135f76a6e0f1e51ad578b53d53847 2018-12-06 remove obsolete data from unicode.c
456a696f72f6db1005cff419b24f20bad89078a0 2018-12-06 remove code duplication and streamline identifiers
c114e275c9f5a7d90e8536c6f353abdb15d0f1ab 2018-12-06 make sure all nodes are freed on error in node_extended_grapheme_cluster()
9d170240957813fc5a4758081676d80cc97c3785 2018-12-06 remove an unused variable
7780553c3082124591343bb9fb54820cbcaf7931 2018-12-07 remove code duplication and put everything into forward order
9bb2da28d56e472b7ec156bf1779cf3d974737f7 2018-12-07 convert check for array length to assertion and comment out
a3798256c798255d30c365f689dc12c1eeeb40c3 2018-12-07 replace hardcoded emoji version by RbConfig::CONFIG['UNICODE_EMOJI_VERSION']
c2d8078e3d000ebe074bffcad5b0fae8bd342f52 2018-12-09 delete Unicode 10.0.0 related files, no longer needed [#14802] This line, and those below, will be ignored--
3628eae2e754a7489feebc6f41371d42d2efcf3c 2018-12-09 implement special behavior for Georgian for String#capitalize
dff1e89bfb77e5d57bc56312364ac2036f3f7a99 2018-12-10 add a test to make sure some unassigned codepoints do not get converted
448e86d796b67d4535c2443f379031ace3388f60 2018-12-11 add clarification about details of update to Unicode 11.0.0 [ci skip]
57a67eb8a476bb0a73385a09c1c35b2d55be524e 2018-12-12 Enhance Proc docs [Misc #14610]
5a5dafb78ebd1663acda5d361092d664580ed7bb 2018-12-14 tweak syntax of Proc documentation [ci skip]
eb2e931f6f44c7f3c0701ba38ba8bdd9e3a3b40b 2018-12-19 add option to deal with Unicode beta data files
13c15beb68916b41f69e8d1e96f61478368ec9ac 2018-12-19 avoid passing unnecessary options to download method
89e3d2f8d3db4ae43ee1f3a922136f30595f8467 2019-01-05 separate downloading of Unicode property files and auxiliary files
b991ccc2103eb468cc663a1263750fb86d0745ae 2019-01-06 add logic to handle Unicode beta period file names
d9699fadf8b7031fb92854106d7703b8370eb95f 2019-01-06 simplify filename-related code
e2d1bb1fa4850c8212a7cdbd76a92634a1115453 2019-02-26 add exceptions for indenting conventions for files related to regular expressions
cff7eefa07f8e7bccb5c0b8c856e5657d6d45ffa 2019-03-06 update Unicode version (and Emoji version) to 12.0.0
b3f641085e347d55c1b9b72e25a19c28cdeb84f6 2019-03-06 add news about Unicode version update (issue #15321) to NEWS [ci skip]
f831ca67642101e29bd2de91998d80560d2b7fb6 2019-03-06 delete directory and files related to Unicode version 11.0.0
c604219e8d79f4a224e446a4aa19711155070ef8 2019-04-04 change lib/unicode_normalize/tables.rb to single item per line to make diffs shorter
7fe64d17d3cd455a3f014d6f756cb201320f7f9a 2019-04-05 update to Unicode Version 12.1.0 (beta)
7fd8cdf398456b2da0f6fe541b77826051e60624 2019-04-05 add puts statements to debug Unicode file download (temporary)
8f734dc7b543e7662b66cffd5d698d372d9a0b7a 2019-04-05 output more debug information in downloader.rb (temporary)
cab005db92da4c867a464d2faf27ece30f61b26b 2019-04-05 revert r67445, (r67446,) r67447
50eea44a27cc0b9b2c5c24b725a1119ec0c00e5c 2019-04-05 remove Unicode 12.0.0 related directory and generated files
a2429de63ce82c48a2c08ec4b0f9e89f700345cf 2019-04-09 mention support for Unicode Version 12.1.0 (issue #15195) in NEWS
e713c2bde8117f3aaf39d827d3fc504cd5866976 2019-05-12 switch UNICODE_BETA to NO
c06f9e1d24e0e164506c6b5946a02814319485cc 2019-05-24 switch UNICODE_BETA to NO (one more try, first try didn't work)
f258137083051a7fc2412c62e3fb239f93d1fdf8 2019-06-05 Fix grammar of macro name: ECCESSED -> ECCESSIVE
7f79a86d8b4d250ea0e82eb06cd3336edb840a01 2019-06-13 add comments to mention sort.reverse!
369ff79394765ce198ac7cee872a8c739d895aaa 2019-07-14 add encoding conversion from/to CESU-8
c54635c08b89612bbf2faa50b86fd6d5c8758506 2019-10-29 Update Unicode Emoji version from 12.0 to 12.1.
bc7fbb6f02f47f2e1ac2bcb3175324bb96c08247 2019-10-31 Mention update to Unicode Emoji version 12.1 in NEWS.
f6539202c52a051a4e6946a318a1d9cd29002990 2021-07-01 - add regression tests for U+6E7F (湿) in ISO-2022-JP
b6113a3fd8471a9bd41a866c602746e3eb391de0 2021-07-04 Update common.mk to deal with Unicode version 13.0.0
94fc4b18695d3ffa7a379b0ee7150eef237ac1cf 2021-07-04 Adjust tool/enc-unicode.rb to deal with new location of some emoji files
99cd0e1f79186f7beb15d8a1f3c9f030d5d07de0 2021-07-04 Update lib/unicode_normalize/tables.rb to Unicode version 13.0.0
ba357bace26a118e2ef10ca25cf3741bfb971bb6 2021-07-04 Add new Unicode/Emoji version checks to spec/ruby/library/rbconfig
323ff38c04d15c3efe5eed497620d9bf242817e6 2021-07-04 Add directory and include files for Unicode version 13.0.0
146a943d9aabb9dda3c5fa1fc3d493c1df969184 2021-07-04 Fix ruby_version guards in Unicode/Emoji version specs
3b36e34b9081fc0e43b2fde484f6d07f45ddb2b1 2021-07-08 Adapt test_emoji_breaks.rb to Unicode 13.0.0/Emoji 13.0
6072239121360293dbd2ed607f16b6a11668999a 2021-07-09 Remove no longer needed include files (Unicode Version 12.1.0)
01e98d87853bd612e6ac03a517cb60b077f5339c 2021-07-27 Change Unicode Emoji Version from 13.0 to 13.1
3010b4047727a17fe5b09df15ee3ea1b7efc2cc1 2021-07-27 Adjust test/ruby/enc/test_emoji_breaks.rb to handle Emoji Version 13.1
26b1e6fca8f2c9e02e7b5d26fb35a16f3e7a66a3 2021-07-27 Deal with Unicode ranges in the file emoji-sequences.txt
fd7f61cf2133748075f738e5fb171b62190221bc 2021-08-17 Take into account data in emoji-variation-sequences.txt in tests.
21fd83a823145042767cc5f237b41cc6bb51f2e9 2021-08-17 Mention update to Unicode Version 13.0.0 and Emoji Version 13.1
45b8846bec5b8228767100fceb9389023560dfa1 2021-09-01 Fix minor grammar issue in documentation of Regexp#match? [ci skip]
f2ffa8896436050af679db6b5dff7b1fc35ef819 2021-09-01 Show default argument explicitly for Rexexp#match? [ci skip]
45e0711f29f9ce65cd34ad14e3af1546ccc7252e 2021-12-09 update Unicode Version to 14.0.0 and Emoji version to 14.0
8e1f3a96aecb3defc34556d75e3d2a0867416082 2022-03-11 switch UNICODE_BETA back to NO
2672502457523317268ac24704cf85df91e2cae6 2022-03-11 mention Unicode Version 14.0.0
9b545b0caf2ccc89718ba02ff631d2a68b96a831 2022-03-11 update specs to check for Unicode Version 14.0.0/Emoji Version 14.0
8f59482f5d1f77542fe3c8e8434cb1dbbc764ecf 2022-03-12 add some tests for Unicode Version 14.0.0
45187a0fcddecc74dacc1881f2405a5ebe198081 2022-03-13 comment out failing Unicode/Emoji version checks temporarily
267f0089d3255c1f06ab5adf9f6c77b1ccfd2771 2022-03-13 clarify meaning of version guards for Unicode version specs [ci skip]
56d9d78f14b73cb9f609558e6b760dde50872fb6 2022-03-15 Remove Unicode 13.0.0 related files
48f1e8c5d85043e6adb8e93c94532daa201d42e9 2022-03-15 Fix version check to use Emoji version for emoji-variation-sequences.txt
6d603cfde1cadcc44b8b331459832f2675594702 2022-03-16 add Unicode auxiliary files and Unicode UCD emoji files download for ALWAYS_UPDATE_UNICODE
e85547ae81e126a89a7bc9436af4a26c24c6f754 2022-03-16 Revert "Allow `.0` version mismatch to pass the tests"
151208a19b09e29d45a79d92d73b67ec2ff692a6 2022-03-16 Revert "Fix version check to use Emoji version for emoji-variation-sequences.txt"
e31d645da58abce67a0fb8b27251ee532adcfab5 2022-12-05 fix wrong property information in name2ctype.h
f82a38be2616b190377877646fd91e3314626085 2022-12-06 change Unicode version to 15.0.0
ea532bd557e7579f82f1375cf9bcad5ade5fd6f8 2022-12-06 add file version check for new Unicode emoji file header
156069d7b6a44656e1155ba8f39dbda3be9d6724 2022-12-06 update Unicode normalization tables to version 15.0.0
66bf15636f26e84c644d38d4ce6a61a21d059706 2022-12-06 move Unicode include files to version 15.0.0 directory
368b844874afe10d4a402baa15f1e3fd6cb14e16 2022-12-06 update Unicode include files to version 15.0.0
e0487ec45e12044874473d6c4a05cc25a00d8afb 2022-12-06 update rbconfig spec to Unicode version 15.0.0
68bb6bfc35e8971bde21dd2ed4fc5406d5dc5e7c 2022-12-06 update NEWS to mention Unicode version 15.0.0
11f28f3268f9234b6310bc659de7e46dfc5c0706 2023-01-16 [DOC] Explain that transcoding to the same encoding just copies.
30326844c7e0957bf85ac1befd3755cff2848e9f 2023-05-13 fix wording of spec description [ci skip]
27e61e2df749c82ea3e9d9e6d1004ed718fefeee 2025-01-03 Speed up initial test creation in test_case_comprehensive.rb: - Change generate_case_mapping_tests to generate_single_byte_case_mapping_tests (all encodings using this method happen to be single-byte) - Change precalculation of codepoints by converting from specific encoding, not from Unicode. This reduces creation time. (See https://github.com/ruby/ruby/pull/7425 for a related proposal and some additional comments.)
900ece77b2d338300ea79d19e7c623043a0be810 2025-04-18 prepare Unicode normalization for Unicode 16.0.0