Elementary Discourse Units (EDUs)

Discourse segmentation refers to the task of breaking a given text into a sequence of elementary discourse units (EDUs). EDUs are clause-like units that serve as building blocks for discourse parsing in Rhetorical Structure Theory.
For example, there are 3 EDUs in this sentence (color coded): Sheraton and Pan Am said they are assured under the Soviet joint-venture law that they can repatriate profits from their hotel venture.


Jing Li, Aixin Sun, and Shafiq Joty, SegBot: A Generic Neural Text Segmentation Model with Pointer Network , IJCAI 2018.