Open
Description
Ack should support Unicode properly on perls that can handle it.
- Needs to run on perl 5.14 or later (I think this is the minimum version, verify)
- This isn't entirely true; we can handle encoding/decoding and normalization with 5.8. We can't, however, have Unicode-aware regular expressions, nor can we use the shiny new stuff that is included in 5.14. More details later.
- -g patterns should work properly with Unicode filenames or regexes containing Unicode characters (or stuff from
charnames
)- This applies when the composition/decomposition of the regular expression and the source vary.
- The same rules apply to the file searching patterns.
- We need to make sure we properly encode/decode files (this could be tough)
- How do we determine files' encodings? Do we assume UTF-8? Do we provide an option for use in ackrc?
- The output stream should probably be UTF-8 encoded.
- Additional options for collation level should probably be provided.