Skip to content

Email address contains more than three special chars(punctuation) is removed by Docsplit.clean_text method #144

@mraj-rpx

Description

@mraj-rpx

I have a email in the pdf like mohan-ramanujam@gmail.com or mohan.raman.visal@gmail.com, the corresponding line number the text_cleaner.rb file is
81 (w[1...-1].scan(PUNCT).uniq.length >= 3) ||
@knowtheory, @jashkenas , @samuelclay : Please provide your opinion on this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions