New subject: [ruby-core:118014] [Ruby master Misc#20504] Interpolated string literal in regexp encoding handling

23 May 2024

      Issue #20504 has been reported by kddnewton (Kevin Newton).

----------------------------------------
Misc #20504: Interpolated string literal in regexp encoding handling
https://bugs.ruby-lang.org/issues/20504

* Author: kddnewton (Kevin Newton)
* Status: Open
----------------------------------------
There is some very odd behavior that I'm not sure is intentional or not, so I'm looking for guidance. In here:

```ruby
# encoding: us-ascii

interp = "\x80"
regexp = /#{interp}/
```

the `regexp` variable is a ascii-8bit regular expression with the byte interpolated into the middle. However, if you inline that interpolation:

```ruby
# encoding: us-ascii

regexp = /#{"\x80"}/
```

you get a syntax error, saying it's an invalid multi-byte character. I'm not sure what the rule is here, as it seems inconsistent. Is this the correct behavior?

I would prefer if it would create an ascii-8bit regular expression like the first example, which would be consistent.

-- 
https://bugs.ruby-lang.org/

[ruby-core:117990] [Ruby master Misc#20504] Interpolated string literal in regexp encoding handling

kddnewton (Kevin Newton)

Eregon (Benoit Daloze)

kddnewton (Kevin Newton)

mame (Yusuke Endoh)

tags

participants (3)