Please note, due to scheduled maintenance online transactions will not be possible between 07:00 - 10:00 GMT, on Thursday 21st March (03:00-06:00 EDT, 21 March, 2019). We apologise for any inconvenience.
Regular-expression derivatives are an old, but elegant, technique for compiling regular expressions to deterministic finite-state machines. It easily supports extending the regular-expression operators with boolean operations, such as intersection and complement. Unfortunately, this technique has been lost in the sands of time and few computer scientists are aware of it. In this paper, we reexamine regular-expression derivatives and report on our experiences in the context of two different functional-language implementations. The basic implementation is simple and we show how to extend it to handle large character sets (e.g., Unicode). We also show that the derivatives approach leads to smaller state machines than the traditional algorithm given by McNaughton and Yamada.
Email your librarian or administrator to recommend adding this journal to your organisation's collection.