Littera Deusto

Modern Languages, Basque Studies and Humanities

Automatic Summarziation (Questionnaire 2)

mayo 15th, 2009 · No hay Comentarios

This technique helps us summarize the content of a text automatically. It is based on statistical, linguistical and heuristic methods where a summarization key controls how many times a considered “key word” appears. These key words belong to something which the experts call open class words.

This summarization system controls how many times the key words appear, all the verb tenses that appear and the exact location of each sentence in the text. It takes into account all the bold parts and similar, if there are any in the text. All the information that the system collects is finally summarized after passing through the whole proccess.

Contrary t what us humans do, which is summarizing a text by reading it firstm understanding it and then using our knowledge summarizing it, computers, which do not contain that capacity that we do analyze the texts statistically and linguistically, then see wheer some words are important or not and then taking that as a base, they do summarize it.

Human generates a summary of a text by understanding it by the deep semantic processings using huge domain/common knowledge. It is too difficult for the current computer to simulate this human’s processes.Therefore, most automatic summarization programs analyze a text statistically and linguistically, determine important sentences, and generate a summary text from these important sentences.

Automatic summarization ivolves reducing a text document or a larger corpus of multiple documents into a short set of words or paragraph that conveys the main meaning of the text.

References:

Etiquetas:

  • Etiquetas