Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

TANBERK, SENEM

Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

Yazarlar (1)
Dr. Öğr. Üyesi Senem TANBERK Kurum Bilgileri Mühendislik ve Mimarlık Fakültesi Yazılım Mühendisliği Bölümü - Ana Bilim Dalı Özgeçmiş Sayfası İletişim Bilgileri: Huawei R&D İstanbul, Türkiye

Devamını Göster

Özet

Multimodal summarization is a kind of summarization application in which its inputs and/or outputs can be in different data types like text, video, and audio. In this study, a new approach based on fine tuning of different pre-trained transformers was developed for abstractive and extractive summarization of audio and text data. In the proposed method, abstractive and extractive summaries of text data are provided only as text, while extractive summaries of audio data are presented as both text and audio data. Abstractive summaries of the audio data are presented as text only. Transformers with text2text input-output relationship were used in both extractive and abstractive summarization processes of the proposed method. For the training and inference processes of audio this type of data to be handled in transformers, an ASR step was followed before the summarization step. The experimental results obtained were …

Anahtar Kelimeler

Bildiri Türü	Tebliğ/Bildiri
Bildiri Alt Türü	Tam Metin Olarak Yayınlanan Tebliğ (Uluslararası Kongre/Sempozyum)
Bildiri Niteliği	Alanında Hakemli Uluslararası Kongre/Sempozyum
Bildiri Dili	İngilizce
Kongre Adı	2024 28th International Conference on Information Technology (IT)
Kongre Tarihi	21-02-2024 / 21-02-2024
Basıldığı Ülke	Türkiye
Basıldığı Şehir

BM Sürdürülebilir Kalkınma Amaçları

Atıf Sayıları
Google Scholar	11

Akademisyenler > Senem TANBERK > Yayın Detayı

Transformer Based Multimodal Summarization and Highlight Abstraction Approach for Texts and Speech Audios

Paylaş