Gigabyte per Second Unicode Regex Search with Icgrep
Hacker News
Let's look for lines containing Greek characters in 2.78 GB Arabic language Wikipedia file. We'll use a tiny Intel NUC box with a low-power Intel Core i3-5010U CPU @ 2.10GHz and a Samsung SM 951 SSD (256GB solid-state drive with PCI express interface). …
25 October 2015