Skip to content

Add proper Unicode support #258

Open
@hoelzro

Description

@hoelzro

Ack should support Unicode properly on perls that can handle it.

  • Needs to run on perl 5.14 or later (I think this is the minimum version, verify)
    • This isn't entirely true; we can handle encoding/decoding and normalization with 5.8. We can't, however, have Unicode-aware regular expressions, nor can we use the shiny new stuff that is included in 5.14. More details later.
  • -g patterns should work properly with Unicode filenames or regexes containing Unicode characters (or stuff from charnames)
    • This applies when the composition/decomposition of the regular expression and the source vary.
  • The same rules apply to the file searching patterns.
  • We need to make sure we properly encode/decode files (this could be tough)
    • How do we determine files' encodings? Do we assume UTF-8? Do we provide an option for use in ackrc?
  • The output stream should probably be UTF-8 encoded.
  • Additional options for collation level should probably be provided.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions