Marginally relevant to automatic summarisation, but quite a nice introduction by Prof. Stephan P Kudyba from NJIT School of Managemen to what data mining is and how it is being used in the business world. Not as funny as the beer & diapers classic example used in data mining, but more authentic.
It is nice to see that people still create resources for automatic summarisation. A few days ago, The Essex Arabic Summaries Corpus (EASC) was announced on corpora list. The corpus contains 153 Arabic articles and 765 human-generated extractive summaries produced using Mechanical Turk. The authors state that the corpus contains copyrighted material and it is the responsibility of the users to makes sure they comply with the legislation in their country. Unfortunately no further information is given in order to know what material was used. The annotation is freely available for research purposes and is distributed under the Creative Commons Attributive/Share Alike license. As a bonus the corpus comes with an Arabic version of ROUGE.
The corpus cannot be downloaded directly. Instead the author needs to be contacted.
A new tutorial on automatic summarisation from RANLP2009 was added to the site. It can be viewed online using iPaper or downloaded for offline reading.
Welcome the the Automatic Summarization Online web site. The purpose of this web site is to provide pointers and information relevant to the field of automatic summarisation. For the moment there is not that much, but hopefully more information will be added in the coming weeks and months.
This site aims to be a hub for information from the field of automatic summarization, trying to reflect its current state. Currently the site is under heavy development, so come back soon for more information.