New Features
- Improved algorithm results.
- Optimal Exact Match is now learned (as the first regex).
- Expansion Factor is displayed for each regex: Expansion Factor = (number of strings matched) / (number of original input strings).
- Abstraction Type is now displayed.
To demonstrate this, here are two actual email results received:
Example 1: Simple example with 2 input strings and a choice of 2 learned regexes (1 Match and 1 Exact Match):
MLREGEX results 2024-11-09 14:08:55 UTC
Your Learn Event Name/Description:
Coffee and Tea
Your set of input strings (2)
coffee
tea
Learned Regexes (2)
1. OPTIMAL EXACT MATCH:
EXPANSION FACTOR: 1.0X
cof{2}e{2}|tea
2. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 4.0X
(cof{2}|t)e{1,2}a?
Example 2: URL example with 13 input strings and a choice of 8 learned regexes (5 Matches and 3 Exact Matches):
MLREGEX results 2024-11-09 14:11:34 UTC
Your Learn Event Name/Description:
URLs
Your set of input strings (13)
http://1.alpha.com
http://2.alpha.com
http://3.alpha.com
http://4.beta.com
http://5.beta.com
http://6.beta.org
http://7.beta.org
https://1.alpha.com
https://2.alpha.com
https://3.alpha.com
https://4.beta.com
https://5.beta.com
https://6.alpha.org
Learned Regexes (8)
1. OPTIMAL EXACT MATCH:
EXPANSION FACTOR: 1.0X
ht{2}p(s?:/{2}((1|2|3)\.alph|(4|5)\.bet)a\.com|(s:/{2}6\.alph|:/{2}(6|7)\.bet)a\.org)
2. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 8.6X
ht{2}ps?:/{2}(1|2|3|4|5|6|7)\.(alph|bet)a\.c?o(m|rg)
3. MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 1.2X
(ht{2}ps?:/{2}(1|2|3)\.alpha\.com|ht{2}ps?:/{2}(4\.beta\.com|5\.beta\.com|6\.alpha\.org|(6|7)\.beta\.org))
4. EXACT MATCH: ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 1.0X
((ht{2}ps:/{2}(1|2|3)|ht{2}p:/{2}(1|2|3))\.alpha\.com|(ht{2}ps|ht{2}p):/{2}4\.beta\.com|(ht{2}ps|ht{2}p):/{2}5\.beta\.com|ht{2}p:/{2}(6|7)\.beta\.org|ht{2}ps:/{2}6\.alpha\.org)
5. MATCH: ABSTRACTION TYPE: Digit Class 1
EXPANSION FACTOR: 3.9X
ht{2}p(s?:/{2}(\d\.alph|\d\.bet)a\.com|(s:/{2}6\.alph|:/{2}\d\.bet)a\.org)
6. MATCH: ABSTRACTION TYPE: Word Class 1
EXPANSION FACTOR: 24.3X
ht{2}p(s?:/{2}(\w\.alph|\w\.bet)a\.com|(s:/{2}6\.alph|:/{2}\w\.bet)a\.org)
7. EXACT MATCH: ABSTRACTION TYPE: Digit Ranges 1
EXPANSION FACTOR: 1.0X
ht{2}p(s?:/{2}([1-3]\.alph|[4-5]\.bet)a\.com|(s:/{2}6\.alph|:/{2}[6-7]\.bet)a\.org)
8. MATCH: ABSTRACTION TYPE: Character Ranges 2
EXPANSION FACTOR: 76923.1X
ht{2}p[.-/1-7:a-ceg-hl-mo-pr-t]{13,15}