Issue #19455 has been updated by nagachika (Tomoyuki Chikanaga).
Backport changed from 3.0: DONTNEED, 3.1: DONTNEED, 3.2: REQUIRED to 3.0: DONTNEED, 3.1:
DONTNEED, 3.2: DONE
ruby_3_2 be09d77b966c7bcc77957927f16cefe66b365495 merged revision(s)
a8ba1ddd78544b4bda749051d44f7b2a8a0ec5ff.
----------------------------------------
Bug #19455: Ruby 3.2: wrong Regexp encoding with non-ASCII comments
https://bugs.ruby-lang.org/issues/19455#change-103901
* Author: janosch-x (Janosch Müller)
* Status: Closed
* Priority: Normal
* ruby -v: ruby 3.2.0 (2022-12-25 revision a528908271)
* Backport: 3.0: DONTNEED, 3.1: DONTNEED, 3.2: DONE
----------------------------------------
comments and comment groups don't trigger the correct `Regexp#encoding` on Ruby 3.2
anymore:
```ruby
# ruby 3.1
/#a/x.encoding # => #<Encoding:US-ASCII> # OK
/(?#a)/.encoding # => #<Encoding:US-ASCII> # OK
/#ü/x.encoding # => #<Encoding:UTF-8> # OK
/(?#ü)/.encoding # => #<Encoding:UTF-8> # OK
# ruby 3.2
/#a/x.encoding # => #<Encoding:US-ASCII> # OK
/(?#a)/.encoding # => #<Encoding:US-ASCII> # OK
/#ü/x.encoding # => #<Encoding:US-ASCII> # BUG
/(?#ü)/.encoding # => #<Encoding:US-ASCII> # BUG
/#ü/x.inspect # => "/#\\xC3\\xBC/x"
/(?#ü)/.inspect # => "/(?#\\xC3\\xBC)/"
# bug is hidden if there are non-ascii chars outside comments
/ü#ü/x.encoding # => #<Encoding:UTF-8>
/ü(?#ü)/.encoding # => #<Encoding:UTF-8>
```
i think these changes might be the cause:
https://github.com/ruby/ruby/commit/ec3542229b29ec93062e9d90e877ea29d3c1947…
@jeremyevans0 JFYI
--
https://bugs.ruby-lang.org/