New Features

  1. Improved algorithm results.
  2. Optimal Exact Match is now learned (as the first regex).
  3. Expansion Factor is displayed for each regex: Expansion Factor = (number of strings matched) / (number of original input strings).
  4. Abstraction Type is now displayed.

To demonstrate this, here are two actual email results received:

Example 1: Simple example with 2 input strings and a choice of 2 learned regexes (1 Match and 1 Exact Match):


MLREGEX results 2024-11-09 14:08:55 UTC

Your Learn Event Name/Description:

Coffee and Tea

Your set of input strings (2)

coffee

tea

Learned Regexes (2)

1. OPTIMAL EXACT MATCH:
EXPANSION FACTOR: 1.0X

cof{2}e{2}|tea


2. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 4.0X

(cof{2}|t)e{1,2}a?




Example 2: URL example with 13 input strings and a choice of 8 learned regexes (5 Matches and 3 Exact Matches):


MLREGEX results 2024-11-09 14:11:34 UTC

Your Learn Event Name/Description:

URLs

Your set of input strings (13)

http://1.alpha.com

http://2.alpha.com

http://3.alpha.com

http://4.beta.com

http://5.beta.com

http://6.beta.org

http://7.beta.org

https://1.alpha.com

https://2.alpha.com

https://3.alpha.com

https://4.beta.com

https://5.beta.com

https://6.alpha.org

Learned Regexes (8)

1. OPTIMAL EXACT MATCH:
EXPANSION FACTOR: 1.0X

ht{2}p(s?:/{2}((1|2|3)\.alph|(4|5)\.bet)a\.com|(s:/{2}6\.alph|:/{2}(6|7)\.bet)a\.org)


2. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 8.6X

ht{2}ps?:/{2}(1|2|3|4|5|6|7)\.(alph|bet)a\.c?o(m|rg)


3. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 1.2X

(ht{2}ps?:/{2}(1|2|3)\.alpha\.com|ht{2}ps?:/{2}(4\.beta\.com|5\.beta\.com|6\.alpha\.org|(6|7)\.beta\.org))


4. EXACT MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 1.0X

((ht{2}ps:/{2}(1|2|3)|ht{2}p:/{2}(1|2|3))\.alpha\.com|(ht{2}ps|ht{2}p):/{2}4\.beta\.com|(ht{2}ps|ht{2}p):/{2}5\.beta\.com|ht{2}p:/{2}(6|7)\.beta\.org|ht{2}ps:/{2}6\.alpha\.org)


5. MATCH: ABSTRACTION TYPE: Digit Class 1
EXPANSION FACTOR: 3.9X

ht{2}p(s?:/{2}(\d\.alph|\d\.bet)a\.com|(s:/{2}6\.alph|:/{2}\d\.bet)a\.org)


6. MATCH: ABSTRACTION TYPE: Word Class 1
EXPANSION FACTOR: 24.3X

ht{2}p(s?:/{2}(\w\.alph|\w\.bet)a\.com|(s:/{2}6\.alph|:/{2}\w\.bet)a\.org)


7. EXACT MATCH: ABSTRACTION TYPE: Digit Ranges 1
EXPANSION FACTOR: 1.0X

ht{2}p(s?:/{2}([1-3]\.alph|[4-5]\.bet)a\.com|(s:/{2}6\.alph|:/{2}[6-7]\.bet)a\.org)


8. MATCH: ABSTRACTION TYPE: Character Ranges 2
EXPANSION FACTOR: 76923.1X

ht{2}p[.-/1-7:a-ceg-hl-mo-pr-t]{13,15}