This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.
2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
=> "\xFC\xA1\xA1\xA1\xA1\xA1abc"
2.7.1 :004 > haystack.scan(/.+/)
=> ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdeskEdit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.
yxhuvud•15h ago