2011 |
A Generate and Rank Approach to Sentence Paraphrasing (Paper in Conference Proceedings) Malakasiotis, Prodromos; Androutsopoulos, Ion Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011. @inproceedings{Malakasiotis2011, title = {A Generate and Rank Approach to Sentence Paraphrasing}, author = {Prodromos Malakasiotis and Ion Androutsopoulos}, url = {http://www.aclweb.org/anthology/D11-1009}, year = {2011}, date = {2011-01-03}, booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing}, abstract = {We present a method that paraphrases a given sentence by first generating candidate paraphrases and then ranking (or classifying) them. The candidates are generated by applying existing paraphrasing rules extracted from parallel corpora. The ranking component considers not only the overall quality of the rules that produced each candidate, but also the extent to which they preserve grammaticality and meaning in the particular context of the input sentence, as well as the degree to which the candidate differs from the input. We experimented with both a Maximum Entropy classifier and an SVR ranker. Experimental results show that incorporating features from an existing paraphrase recognizer in the ranking component improves performance, and that our overall method compares well against a state of the art paraphrase generator, when paraphrasing rules apply to the input sentences. We also propose a new methodology to evaluate the ranking components of generate-and-rank paraphrase generators, which evaluates them across different combinations of weights for grammaticality, meaning preservation, and diversity. The paper is accompanied by a paraphrasing dataset we constructed for evaluations of this kind.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We present a method that paraphrases a given sentence by first generating candidate paraphrases and then ranking (or classifying) them. The candidates are generated by applying existing paraphrasing rules extracted from parallel corpora. The ranking component considers not only the overall quality of the rules that produced each candidate, but also the extent to which they preserve grammaticality and meaning in the particular context of the input sentence, as well as the degree to which the candidate differs from the input. We experimented with both a Maximum Entropy classifier and an SVR ranker. Experimental results show that incorporating features from an existing paraphrase recognizer in the ranking component improves performance, and that our overall method compares well against a state of the art paraphrase generator, when paraphrasing rules apply to the input sentences. We also propose a new methodology to evaluate the ranking components of generate-and-rank paraphrase generators, which evaluates them across different combinations of weights for grammaticality, meaning preservation, and diversity. The paper is accompanied by a paraphrasing dataset we constructed for evaluations of this kind. |
A New Sentence Compression Dataset and Its Use in an Abstractive Generate-and-Rank Sentence Compressor (Paper in Conference Proceedings) Galanis, Dimitrios; Androutsopoulos, Ion Proceedings of the UCNLG+Eval: Language Generation and Evaluation Workshop, Pages: 1-11, 2011. @inproceedings{Galanis2011, title = {A New Sentence Compression Dataset and Its Use in an Abstractive Generate-and-Rank Sentence Compressor}, author = {Dimitrios Galanis and Ion Androutsopoulos}, url = {http://www.aclweb.org/anthology/W11-2701}, year = {2011}, date = {2011-01-03}, booktitle = {Proceedings of the UCNLG+Eval: Language Generation and Evaluation Workshop}, pages = {1-11}, abstract = {Sentence compression has attracted much interest in recent years, but most sentence compressors are extractive, i.e., they only delete words. There is a lack of appropriate datasets to train and evaluate abstractive sentence compressors, i.e., methods that apart from deleting words can also rephrase expressions. We present a new dataset that contains candidate extractive and abstractive compressions of source sentences. The candidate compressions are annotated with human judgements for grammaticality and meaning preservation. We discuss how the dataset was created, and how it can be used in generate-and-rank abstractive sentence compressors. We also report experimental results with a novel abstractive sentence compressor that uses the dataset. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Sentence compression has attracted much interest in recent years, but most sentence compressors are extractive, i.e., they only delete words. There is a lack of appropriate datasets to train and evaluate abstractive sentence compressors, i.e., methods that apart from deleting words can also rephrase expressions. We present a new dataset that contains candidate extractive and abstractive compressions of source sentences. The candidate compressions are annotated with human judgements for grammaticality and meaning preservation. We discuss how the dataset was created, and how it can be used in generate-and-rank abstractive sentence compressors. We also report experimental results with a novel abstractive sentence compressor that uses the dataset. |
2010 |
MOPSEUS - A Digital Repository System with Semantically Enhanced Preservation Services (Paper in Conference Proceedings) Gavrilis, Dimitris; Angelis, Stavros; Papatheodorou, Christos Proceedings of the 7th International Conference on Preservation of Digital Objects, iPRES2010, Vienna, Austria, September 2010, Pages: 135-143, 2010. @inproceedings{Gavrilis2010, title = {MOPSEUS - A Digital Repository System with Semantically Enhanced Preservation Services}, author = {Dimitris Gavrilis and Stavros Angelis and Christos Papatheodorou}, url = {http://www.ifs.tuwien.ac.at/dp/ipres2010/papers/gavrilis-34.pdf}, year = {2010}, date = {2010-09-01}, booktitle = {Proceedings of the 7th International Conference on Preservation of Digital Objects, iPRES2010, Vienna, Austria, September 2010}, journal = {Proceedings of the 7th International Conference on Preservation of Digital Objects, iPRES2010}, pages = {135-143}, abstract = {Repository platforms offer significant tools aiding institutions to preserve the wealth of their information resources. This paper presents the data model as well as the architectural features of Mopseus, a digital library service, built on top of Fedora-commons middleware, designed to facilitate institutions to develop and preserve their own repositories. The main advantage of Mopseus is that it minimizes the customization and programming effort that Fedora-commons involves. Moreover it provides an added value service which semantically annotates the internal structure of a Digital Object. The paper focuses on the preservation functionalities of Mopseus and presents a mechanism for automated generation of PREMIS metadata for each Digital Object of the repository. This mechanism is activated whenever an object is modified and is based on a mapping of the Mopseus data model to the PREMIS data model that ensures the validity of the transformation of the information stored in a Mopseus repository to semantically equivalent PREMIS metadata. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Repository platforms offer significant tools aiding institutions to preserve the wealth of their information resources. This paper presents the data model as well as the architectural features of Mopseus, a digital library service, built on top of Fedora-commons middleware, designed to facilitate institutions to develop and preserve their own repositories. The main advantage of Mopseus is that it minimizes the customization and programming effort that Fedora-commons involves. Moreover it provides an added value service which semantically annotates the internal structure of a Digital Object. The paper focuses on the preservation functionalities of Mopseus and presents a mechanism for automated generation of PREMIS metadata for each Digital Object of the repository. This mechanism is activated whenever an object is modified and is based on a mapping of the Mopseus data model to the PREMIS data model that ensures the validity of the transformation of the information stored in a Mopseus repository to semantically equivalent PREMIS metadata. |
Query Transformation in a CIDOC CRM Based Cultural Metadata Integration Environment (Paper in Conference Proceedings) Gergatsoulis, Manolis; Bountouri, Lina; Gaitanou, Panoraia; Papatheodorou, Christos Proceedings of the 14th European Conference Research and Advanced Technology for Digital Libraries, ECDL 2010, Pages: 38-45, 2010. @inproceedings{Gergatsoulis2010, title = {Query Transformation in a CIDOC CRM Based Cultural Metadata Integration Environment}, author = {Manolis Gergatsoulis and Lina Bountouri and Panoraia Gaitanou and Christos Papatheodorou}, url = {http://www.springerlink.com/content/m5m353t715866632/}, year = {2010}, date = {2010-09-01}, booktitle = {Proceedings of the 14th European Conference Research and Advanced Technology for Digital Libraries, ECDL 2010}, pages = {38-45}, abstract = {The wide use of a number of cultural heritage metadata schemas imposes the development of new interoperability techniques that facilitate unified access to cultural resources. In this paper, we focus on the ontology based semantic integration by proposing an expressive mapping language for the specification of the mappings between the XML-based metadata schemas and the CIDOC CRM ontology. We also present an algorithm for the transformation of XPath queries posed on XML-based metadata into equivalent queries on the CIDOC CRM ontology. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } The wide use of a number of cultural heritage metadata schemas imposes the development of new interoperability techniques that facilitate unified access to cultural resources. In this paper, we focus on the ontology based semantic integration by proposing an expressive mapping language for the specification of the mappings between the XML-based metadata schemas and the CIDOC CRM ontology. We also present an algorithm for the transformation of XPath queries posed on XML-based metadata into equivalent queries on the CIDOC CRM ontology. |
Mopseus - A Digital Library Management System Focused on Preservation (Paper in Conference Proceedings) Gavrilis, Dimitris; Papatheodorou, Christos; Constantopoulos, Panos; Angelis, Stavros Proceedings of the 14th European Conference Research and Advanced Technology for Digital Libraries, ECDL 2010, Volume: 6273 Pages: 445-448, 2010. @inproceedings{Gavrilis2010b, title = {Mopseus - A Digital Library Management System Focused on Preservation}, author = {Dimitris Gavrilis and Christos Papatheodorou and Panos Constantopoulos and Stavros Angelis}, url = {http://www.springerlink.com/content/j5025k0058664015/}, year = {2010}, date = {2010-09-01}, booktitle = {Proceedings of the 14th European Conference Research and Advanced Technology for Digital Libraries, ECDL 2010}, volume = {6273}, pages = {445-448}, abstract = {This paper presents Mopseus, a Fedora-commons based digital repository that focuses on preservation. An overview of the general architecture of the system is presented along with some more in-depth details of its semantic structures. Mopseus features dynamic RDF- based relations, a service for defining metadata schemas, a built-in RDBMS synchronization and indexing mechanism, a mechanism for migration from existing repositories and a built-in workflow engine. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } This paper presents Mopseus, a Fedora-commons based digital repository that focuses on preservation. An overview of the general architecture of the system is presented along with some more in-depth details of its semantic structures. Mopseus features dynamic RDF- based relations, a service for defining metadata schemas, a built-in RDBMS synchronization and indexing mechanism, a mechanism for migration from existing repositories and a built-in workflow engine. |
Modelling the Public Sector Information through CIDOC Conceptual Reference Model (Paper in Conference Proceedings) Bountouri, Lina; Papatheodorou, Christos; Gergatsoulis, Manolis Proceedings of the 6th International Workshop on Ontology Content, OnToContent 2010, Volume: 6428 Pages: 404-413, 2010. @inproceedings{Bountouri2010, title = {Modelling the Public Sector Information through CIDOC Conceptual Reference Model}, author = {Lina Bountouri and Christos Papatheodorou and Manolis Gergatsoulis }, url = {http://dl.acm.org/citation.cfm?id=1948597&preflayout=flat}, year = {2010}, date = {2010-01-01}, booktitle = {Proceedings of the 6th International Workshop on Ontology Content, OnToContent 2010}, journal = {Proceedings of the 6th International Workshop on Ontology Content, OnToContent 2010}, volume = {6428}, pages = {404-413}, abstract = {Nowadays, due to the growing development of eGovernment information systems, there is an increasing need to handle Public Sector Information (PSI) in a homogeneous way. Ontologies are currently a powerful tool to act as semantic reference models for the development of information systems and as semantic mediators for achieving interoperability. In this paper, we analyze the procedures that lead to the PSI's production and management and we present all the concepts and agents that relate to it. Based on this analysis and given that CIDOC CRM ontology is able to define the rich semantics of the historical records' production and management, we propose the CIDOC CRM to represent the public records' conceptualization and to act as a reference model for PSI. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Nowadays, due to the growing development of eGovernment information systems, there is an increasing need to handle Public Sector Information (PSI) in a homogeneous way. Ontologies are currently a powerful tool to act as semantic reference models for the development of information systems and as semantic mediators for achieving interoperability. In this paper, we analyze the procedures that lead to the PSI's production and management and we present all the concepts and agents that relate to it. Based on this analysis and given that CIDOC CRM ontology is able to define the rich semantics of the historical records' production and management, we propose the CIDOC CRM to represent the public records' conceptualization and to act as a reference model for PSI. |
2009 |
Finding Short Definitions of Terms on Web Pages (Paper in Conference Proceedings) Lampouras, Gerasimos; Androutsopoulos, Ion Proceedings of the 2009 Conference on Empirical Methods on Natural Language Processing (EMNLP 2009 at ACL/IJCNLP 2009), Suntec, Singapore, 2009, Pages: 1270-1279, 2009, ISBN: 978-1-932432-59-6. @inproceedings{Lampouras2009, title = {Finding Short Definitions of Terms on Web Pages}, author = {Gerasimos Lampouras and Ion Androutsopoulos}, url = {http://nlp.cs.aueb.gr/pubs/emnlp09_paper.pdf}, isbn = {978-1-932432-59-6}, year = {2009}, date = {2009-09-01}, booktitle = {Proceedings of the 2009 Conference on Empirical Methods on Natural Language Processing (EMNLP 2009 at ACL/IJCNLP 2009), Suntec, Singapore, 2009}, pages = {1270-1279}, abstract = {We present a system that finds short definitions of terms on Web pages. It employs a Maximum Entropy classifier, but it is trained on automatically generated examples; hence, it is in effect unsupervised. We use ROUGE-W to generate training examples from encyclopedias and Web snippets, a method that outperforms an alternative centroid-based one. After training, our system can be used to find definitions of terms that are not covered by encyclopedias. The system outperforms a comparable publicly available system, as well as a previously published form of our system.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We present a system that finds short definitions of terms on Web pages. It employs a Maximum Entropy classifier, but it is trained on automatically generated examples; hence, it is in effect unsupervised. We use ROUGE-W to generate training examples from encyclopedias and Web snippets, a method that outperforms an alternative centroid-based one. After training, our system can be used to find definitions of terms that are not covered by encyclopedias. The system outperforms a comparable publicly available system, as well as a previously published form of our system. |
A Digital Library Service for the Small (Paper in Conference Proceedings) Angelis, Stavros; Constantopoulos, Panos; Gavrilis, Dimitris; Papatheodorou, Christos Proceedings of the 2nd Digital Curation Curriculum Symposium: Digital Curation Practice, Promise and Prospects 2009, 2009. @inproceedings{Angelis2009, title = {A Digital Library Service for the Small}, author = {Stavros Angelis and Panos Constantopoulos and Dimitris Gavrilis and Christos Papatheodorou}, url = {http://users.ionio.gr/~papatheodor/papers/digccurr09.pdf}, year = {2009}, date = {2009-04-01}, booktitle = {Proceedings of the 2nd Digital Curation Curriculum Symposium: Digital Curation Practice, Promise and Prospects 2009}, journal = {DigCCurr 2009: Digital Curation Practice, Promise and Prospects}, abstract = {In this paper, we present MOPSEUS, a lightweight digital library service based on the Fedora system. This service was created to address the needs of small libraries without support from technical staff. Hence, MOPSEUS attempts to balance flexibility against ease of installation, configuration and use. The service is available as a standard Java Web servlet, uses no external databases or other systems and can easily be deployed on top of any Fedora installation. Preliminary tests concerning the ease of installation and use are encouraging. We contend that facilitating the introduction of digital library infrastructures in the small may contribute to spreading digital curation practices.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present MOPSEUS, a lightweight digital library service based on the Fedora system. This service was created to address the needs of small libraries without support from technical staff. Hence, MOPSEUS attempts to balance flexibility against ease of installation, configuration and use. The service is available as a standard Java Web servlet, uses no external databases or other systems and can easily be deployed on top of any Fedora installation. Preliminary tests concerning the ease of installation and use are encouraging. We contend that facilitating the introduction of digital library infrastructures in the small may contribute to spreading digital curation practices. |
2008 |
Building an adaptive museum gallery in Second Life (Paper in Conference Proceedings) Oberlander, Jon; Karakatsiotis, George; Isard, Amy; Androutsopoulos, Ion Trant,; Bearman, (Ed.): Museums and the Web 2008: Proceedings, Archives & Museum Informatics, 2008. @inproceedings{Oberlander2008, title = {Building an adaptive museum gallery in Second Life}, author = {Jon Oberlander and George Karakatsiotis and Amy Isard and Ion Androutsopoulos}, editor = {J. Trant and D. Bearman }, url = {http://www.archimuse.com/mw2008/papers/oberlander/oberlander.html}, year = {2008}, date = {2008-03-31}, booktitle = {Museums and the Web 2008: Proceedings}, publisher = {Archives & Museum Informatics}, abstract = {We describe initial work on building a virtual gallery, within Second Life, which can automatically tailor itself to an individual visitor, responding to their abilities, interests, preferences or history of interaction. The description of an object in the virtual world can be personalised to suit the beginner or the expert, varying how it is said—via the choice of language (such as English or Greek), the words, or the complexity of sentences—as well as what is said—by taking into account what else has been seen or described already. The guide delivering the descriptions can remain disembodied, or be embodied as a robotic avatar.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We describe initial work on building a virtual gallery, within Second Life, which can automatically tailor itself to an individual visitor, responding to their abilities, interests, preferences or history of interaction. The description of an object in the virtual world can be personalised to suit the beginner or the expert, varying how it is said—via the choice of language (such as English or Greek), the words, or the complexity of sentences—as well as what is said—by taking into account what else has been seen or described already. The guide delivering the descriptions can remain disembodied, or be embodied as a robotic avatar. |
Aspects of a digital curation agenda for cultural heritage (Paper in Conference Proceedings) Constantopoulos, Panos; Dallas, Costis Proceedings of the IEEE International Conference on Distributed Human-Machine Systems, 2008, Pages: 317–322, 2008. @inproceedings{Constantopoulos2008, title = {Aspects of a digital curation agenda for cultural heritage}, author = {Panos Constantopoulos and Costis Dallas }, url = {http://www.dcu.gr/wp-content/uploads/2016/10/Aspects-of-a-digital-curation-agenda-for-cultural-heritage.pdf}, year = {2008}, date = {2008-03-03}, booktitle = {Proceedings of the IEEE International Conference on Distributed Human-Machine Systems, 2008}, pages = {317--322}, abstract = {Digital curation emerged recently as an important concept in the theory and management of cultural heritage information. This paper presents the approach and research agenda adopted by the newly-founded Digital Curation Unit of Athena Research Centre, Greece, and illustrates its relevance to the management and use of cultural heritage digital collections. It highlights the need to tackle the risks of epistemic failure tied with the prospect of long-term access to curated repositories, and presents the case for multidisciplinary research, informed by humanistic and social science as well as computer science perspectives. A multi-tiered research agenda, it argues, would need to resolve problems of representing domain knowledge; developing and maintaining knowledge resources; streamlining the enrichment of these resources from text; automatically generating text from databases; discovering and accessing domain associations; enabling the use of databases containing valuable data over time; conceptualizations of epistemic discourse, and communication genres in specific contexts; grounded research on the motives, activities and contexts of digital resources appraisal, knowledge enhancement and use; and, cost-benefit assessment of preservation policies. These complementary approaches are particularly relevant in the field of cultural heritage, where large-scale digitisation of heritage resources on one hand, and web-based social computing on the other, already create a deluge of un-curated and poorly represented cultural information.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Digital curation emerged recently as an important concept in the theory and management of cultural heritage information. This paper presents the approach and research agenda adopted by the newly-founded Digital Curation Unit of Athena Research Centre, Greece, and illustrates its relevance to the management and use of cultural heritage digital collections. It highlights the need to tackle the risks of epistemic failure tied with the prospect of long-term access to curated repositories, and presents the case for multidisciplinary research, informed by humanistic and social science as well as computer science perspectives. A multi-tiered research agenda, it argues, would need to resolve problems of representing domain knowledge; developing and maintaining knowledge resources; streamlining the enrichment of these resources from text; automatically generating text from databases; discovering and accessing domain associations; enabling the use of databases containing valuable data over time; conceptualizations of epistemic discourse, and communication genres in specific contexts; grounded research on the motives, activities and contexts of digital resources appraisal, knowledge enhancement and use; and, cost-benefit assessment of preservation policies. These complementary approaches are particularly relevant in the field of cultural heritage, where large-scale digitisation of heritage resources on one hand, and web-based social computing on the other, already create a deluge of un-curated and poorly represented cultural information. |
Designing Interoperable Museum Information Systems (Paper in Conference Proceedings) Gavrilis, Dimitris; Tsakonas, Giannis; Papatheodorou, Christos Proceedings of the 14th International Conference on Virtual Systems and Multimedia, 2008. @inproceedings{Gavrilis2008, title = {Designing Interoperable Museum Information Systems}, author = {Dimitris Gavrilis and Giannis Tsakonas and Christos Papatheodorou}, url = {http://users.ionio.gr/~papatheodor/papers/VSMM08-final.pdf}, year = {2008}, date = {2008-01-01}, booktitle = {Proceedings of the 14th International Conference on Virtual Systems and Multimedia}, abstract = {Museum collections are characterized by heterogeneity, since they usually host a plethora of objects of categories, while each of them requires different description policies and metadata standards. Moreover the museum records, which keep the history and evolution of the hosted collections, request proactive curation in order to preserve this rich and diverse information. In this paper, the architecture of an innovative museum information system, as well as its implementation details is presented. In particular the requirements and the system architecture are presented along with the problems that were encountered. The main directions of the system design are (a) to increase interoperability levels and therefore assist proactive curation and (b) to enhance navigation by the usage of handheld devices. The first direction is satisfied by the design of a rich metadata schema based on the CIDOC/CRM standard. The second direction is fulfilled by the implementation of a module, which integrates the museum database with a subsystem appropriate to support user navigation into the museum floors and rooms. The module is expressed as a navigation functionality, which is accessed through handheld devices and peripherals, such as PDAs and RFID tags. The proposed system is functional and operates into the Solomos Museum, situated in Zakynthos island, Greece.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Museum collections are characterized by heterogeneity, since they usually host a plethora of objects of categories, while each of them requires different description policies and metadata standards. Moreover the museum records, which keep the history and evolution of the hosted collections, request proactive curation in order to preserve this rich and diverse information. In this paper, the architecture of an innovative museum information system, as well as its implementation details is presented. In particular the requirements and the system architecture are presented along with the problems that were encountered. The main directions of the system design are (a) to increase interoperability levels and therefore assist proactive curation and (b) to enhance navigation by the usage of handheld devices. The first direction is satisfied by the design of a rich metadata schema based on the CIDOC/CRM standard. The second direction is fulfilled by the implementation of a module, which integrates the museum database with a subsystem appropriate to support user navigation into the museum floors and rooms. The module is expressed as a navigation functionality, which is accessed through handheld devices and peripherals, such as PDAs and RFID tags. The proposed system is functional and operates into the Solomos Museum, situated in Zakynthos island, Greece. |
Semantic integration of Collection-level information: A Crosswalk between CIDOC/CRM and Dublin Core Collections Application Profile (Paper in Conference Proceedings) Lourdi, Irini; Papatheodorou, Christos Proceedings of the annual conference of the International Documentation Committee of the International Council of Museums (CIDOC 2008), Volume: 15 2008, ISSN: 1082-9873. @inproceedings{Lourdi2008, title = {Semantic integration of Collection-level information: A Crosswalk between CIDOC/CRM and Dublin Core Collections Application Profile}, author = {Irini Lourdi and Christos Papatheodorou}, url = {http://www.ionio.gr/~papatheodor/papers/cidoc2008.pdf}, issn = {1082-9873}, year = {2008}, date = {2008-01-01}, booktitle = {Proceedings of the annual conference of the International Documentation Committee of the International Council of Museums (CIDOC 2008)}, volume = {15 }, number = {7-8}, abstract = {This paper is motivated by the demand for unified access, navigation and information retrieval from the wealth of composite, distributed and heterogeneous digital cultural collections. The last years, collection-level metadata is considered to be the key of integrated access of so many resources, since they represent the inherent and contextual characteristics of a collection. Our effort origins from the semantic interoperability perspective and considers CIDOC/CRM as the mediating schema, which integrates in an optimal way the semantics of the collection level metadata schemas and application profiles. In particular a crosswalk between Dublin Core Collections Application Profile and CIDOC/CRM is presented so that the semantics of each DCCAP element is mapped to CIDOC/CRM. The derived crosswalk is bidirectional implementing the mapping from DCCAP to CIDOC/CRM and vice versa. The paper reveals the complexity of mapping metadata schemas to ontologies and resolves particular difficulties providing a real world semantic integration case. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } This paper is motivated by the demand for unified access, navigation and information retrieval from the wealth of composite, distributed and heterogeneous digital cultural collections. The last years, collection-level metadata is considered to be the key of integrated access of so many resources, since they represent the inherent and contextual characteristics of a collection. Our effort origins from the semantic interoperability perspective and considers CIDOC/CRM as the mediating schema, which integrates in an optimal way the semantics of the collection level metadata schemas and application profiles. In particular a crosswalk between Dublin Core Collections Application Profile and CIDOC/CRM is presented so that the semantics of each DCCAP element is mapped to CIDOC/CRM. The derived crosswalk is bidirectional implementing the mapping from DCCAP to CIDOC/CRM and vice versa. The paper reveals the complexity of mapping metadata schemas to ontologies and resolves particular difficulties providing a real world semantic integration case. |
Preparing DARIAH (Paper in Conference Proceedings) Constantopoulos, Panos; Dallas, Costis; Doorn,; Gavrilis, Dimitris; Gros,; Stylianou, Ioannides, Marinos (Ed.): Digital Heritage - proceedings of the 14th International Conference on Virtual Systems and Multimedia, Pages: 164-166, Archaeolingua, 2008, ISBN: 9789639911017. @inproceedings{Constantopoulos2008, title = {Preparing DARIAH}, author = {Panos Constantopoulos and Costis Dallas and Doorn and Dimitris Gavrilis and Gros and Stylianou}, editor = {Marinos Ioannides}, url = {http://vsmm2008.euromed2010.eu/vsmm2008/e_Proceedings/papers/shortpaper.pdf#page=170 http://www.dcu.gr/wp-content/uploads/2016/10/Preparing-Dariah.pdf}, isbn = {9789639911017}, year = {2008}, date = {2008-01-01}, booktitle = {Digital Heritage - proceedings of the 14th International Conference on Virtual Systems and Multimedia}, pages = {164-166}, publisher = {Archaeolingua}, abstract = {In this paper, a preparatory project for an integrated European research infrastructure in the humanities is presented. This project, Preparing for the construction of the Digital Research Infrastructure for the Arts and Humanities - or Preparing DARIAH for short, is part of the ESFRI e-infrastructures programme and supports the emergence of a new collaborative framework in which researchers are able to maximise the impact of their work on the international stage and aims at providing the foundations for the timely construction of the infrastructure requisite for the arts, humanities and cultural heritage communities in the digital age. DARIAH uses an interdisciplinary approach and involves tackling a number of interrelated issues such as strategic, organisational, financial, technical and conceptual in order to facilitate long-term access to and use of all European humanities and cultural heritage information for the purposes of enhancing and expanding research, thereby increasing our knowledge and understanding of our histories, heritage, languages and cultures. The DARIAH network will act as a place where the incubation of new ideas and ways of working can be facilitated and developed, and then transitioned into established organisations thus ensuring long term sustainability and stability and the integration of these methods and techniques into everyday research practice. DARIAH will support research practitioners at all stages in the research process, and at differing levels of sophistication, from beginners through to those employing advanced techniques and methodologies.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, a preparatory project for an integrated European research infrastructure in the humanities is presented. This project, Preparing for the construction of the Digital Research Infrastructure for the Arts and Humanities - or Preparing DARIAH for short, is part of the ESFRI e-infrastructures programme and supports the emergence of a new collaborative framework in which researchers are able to maximise the impact of their work on the international stage and aims at providing the foundations for the timely construction of the infrastructure requisite for the arts, humanities and cultural heritage communities in the digital age. DARIAH uses an interdisciplinary approach and involves tackling a number of interrelated issues such as strategic, organisational, financial, technical and conceptual in order to facilitate long-term access to and use of all European humanities and cultural heritage information for the purposes of enhancing and expanding research, thereby increasing our knowledge and understanding of our histories, heritage, languages and cultures. The DARIAH network will act as a place where the incubation of new ideas and ways of working can be facilitated and developed, and then transitioned into established organisations thus ensuring long term sustainability and stability and the integration of these methods and techniques into everyday research practice. DARIAH will support research practitioners at all stages in the research process, and at differing levels of sophistication, from beginners through to those employing advanced techniques and methodologies. |
Using Handhelds to Search in Physical and Digital Information Spaces (Paper in Conference Proceedings) Veronikis, Spyros; Gavrilis, Dimitris; Zoutsou, Kyriaki; Papatheodorou, Chritos Proceedings of the 2008 The Second International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies, of the series UBICOMM '08 Pages: 225-230, IEEE Computer Society, 2008. @inproceedings{Veronikis2008, title = {Using Handhelds to Search in Physical and Digital Information Spaces}, author = {Spyros Veronikis and Dimitris Gavrilis and Kyriaki Zoutsou and Chritos Papatheodorou}, url = {http://portal.acm.org/citation.cfm?id=1448098 http://www.dcu.gr/wp-content/uploads/2016/10/Using-Handhelds-to-Search-in-Physical-and-Digital-Information-Spaces.pdf}, doi = {10.1109/UBICOMM.2008.9}, year = {2008}, date = {2008-01-01}, booktitle = {Proceedings of the 2008 The Second International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies}, pages = {225-230}, publisher = {IEEE Computer Society}, series = {UBICOMM '08}, abstract = {In recent years a wealth of information is becoming available thanks to computer and networking technology. Modern libraries incorporate in their collections information content in both physical and digital form. Meanwhile, mobile computing enables the library patrons to access that content anytime, anywhere. In this paper we present the design procedure of a new library service that supports users in seeking information in hybrid collections while being in the stacks, thus enabling content retrieval from a unified information space. Moreover an evaluation model and methodology and the results of an experimental procedure are presented aiming to assess the user satisfaction for the new service.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } In recent years a wealth of information is becoming available thanks to computer and networking technology. Modern libraries incorporate in their collections information content in both physical and digital form. Meanwhile, mobile computing enables the library patrons to access that content anytime, anywhere. In this paper we present the design procedure of a new library service that supports users in seeking information in hybrid collections while being in the stacks, thus enabling content retrieval from a unified information space. Moreover an evaluation model and methodology and the results of an experimental procedure are presented aiming to assess the user satisfaction for the new service. |
Enhancing Library Services with Web 2.0 functionalities (Paper in Conference Proceedings) Gavrilis, Dimitris; Kakali,; Papatheodorou, Christos Research and Advanced Technology for Digital Libraries. 12th European Conference, ECDL 2008, Aarhus, Denmark, September 14-19, 2008. Proceedings, Volume: 5173 of the series Lecture Notes in Computer Science Pages: 148-159, Springer Berlin Heidelberg, 2008. @inproceedings{Gavrilis2008b, title = {Enhancing Library Services with Web 2.0 functionalities}, author = {Dimitris Gavrilis and Kakali and Christos Papatheodorou}, url = {http://www.dcu.gr/wp-content/uploads/2016/10/Enhancing-Library-Services-with-Web-2.0-functionalities.pdf}, year = {2008}, date = {2008-01-01}, booktitle = {Research and Advanced Technology for Digital Libraries. 12th European Conference, ECDL 2008, Aarhus, Denmark, September 14-19, 2008. Proceedings}, volume = {5173}, pages = {148-159}, publisher = {Springer Berlin Heidelberg}, series = {Lecture Notes in Computer Science}, abstract = {In this paper, a prototype of an Online Public Access Catalog (OPAC) is presented. This new OPAC features new functionalities and utilizes web 2.0 technologies in order to deliver improved search and retrieval services. Some of these new services include social tag annotations, user opinions and ranks and tag-based similarity searches. The prototype is evaluated by a user group through questionnaires, interviews and with the system's integrated logging mechanism. The results are encouraging enough and show that Library 2.0 technologies seem to be acceptable by the majority of the users. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, a prototype of an Online Public Access Catalog (OPAC) is presented. This new OPAC features new functionalities and utilizes web 2.0 technologies in order to deliver improved search and retrieval services. Some of these new services include social tag annotations, user opinions and ranks and tag-based similarity searches. The prototype is evaluated by a user group through questionnaires, interviews and with the system's integrated logging mechanism. The results are encouraging enough and show that Library 2.0 technologies seem to be acceptable by the majority of the users. |
A Conversant Robotic Guide to Art Collections (Paper in Conference Proceedings) Vogiatzis, Dimitrios; Galanis, Dimitrios; Karkaletsis, Vangelis; Androutsopoulos, Ion; Spyropoulos, Constantine Proceedings of the 2nd Workshop on Language Technology for Cultural Heritage Data, Language Resources and Evaluation Conference (LREC 2008), 2008. @inproceedings{Vogiatzis2008, title = {A Conversant Robotic Guide to Art Collections}, author = {Dimitrios Vogiatzis and Dimitrios Galanis and Vangelis Karkaletsis and Ion Androutsopoulos and Constantine D. Spyropoulos }, url = {http://www.dcu.gr/wp-content/uploads/2016/10/A-Conversant-Robotic-Guide-to-Art-Collections.pdf}, year = {2008}, date = {2008-01-01}, booktitle = {Proceedings of the 2nd Workshop on Language Technology for Cultural Heritage Data, Language Resources and Evaluation Conference (LREC 2008)}, abstract = {We present the dialogue system of a robot that has been developed to serve as a museum guide. The robot interacts with human visitors in natural language, receiving instructions and providing information about the exhibits. Moreover, being mobile, it physically approaches the exhibits it provides information about. Although the robotic platform contains many modules, including navigation, speech recognition and synthesis, our focus in this paper is the dialogue system, which supports the sessions between humans and the robot, as well as the natural language generation engine, which generates the text to be spoken. Both modules are closely intertwined and depend on an ontology represented in OWL. The robot supports dialogues in both English and Greek. }, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We present the dialogue system of a robot that has been developed to serve as a museum guide. The robot interacts with human visitors in natural language, receiving instructions and providing information about the exhibits. Moreover, being mobile, it physically approaches the exhibits it provides information about. Although the robotic platform contains many modules, including navigation, speech recognition and synthesis, our focus in this paper is the dialogue system, which supports the sessions between humans and the robot, as well as the natural language generation engine, which generates the text to be spoken. Both modules are closely intertwined and depend on an ontology represented in OWL. The robot supports dialogues in both English and Greek. |
2007 |
Integrating Dublin Core Metadata for Cultural Heritage Collections Using Ontologies (Paper in Conference Proceedings) Kakali,; Lourdi, Irini; Stasinopoulou,; Bountouri,; Papatheodorou, Christos; Doerr, Martin; Gergatsoulis, Proceedings of the 7th International Conference on Dublin Core and Metadata Applications, DC-2007, Pages: 128-139, 2007. @inproceedings{Kakali2007, title = {Integrating Dublin Core Metadata for Cultural Heritage Collections Using Ontologies}, author = {Kakali and Irini Lourdi and Stasinopoulou and Bountouri and Christos Papatheodorou and Martin Doerr and Gergatsoulis}, url = {http://eprints.rclis.org/11001/}, year = {2007}, date = {2007-08-01}, booktitle = {Proceedings of the 7th International Conference on Dublin Core and Metadata Applications, DC-2007}, pages = {128-139}, abstract = {Metadata interoperability is an active research area, especially for cultural heritage collections, which consist of heterogeneous objects described by a variety of metadata schemas. In this paper we propose an ontology-based metadata interoperability approach, which exploits, in an optimal way, the semantics of metadata schemas. In particular, we propose the use of CIDOC/CRM ontology as a mediating schema and present a methodology for mapping DC Type Vocabulary to CIDOC/CRM, demonstrating a real-world effort for ontology-based metadata integration.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } Metadata interoperability is an active research area, especially for cultural heritage collections, which consist of heterogeneous objects described by a variety of metadata schemas. In this paper we propose an ontology-based metadata interoperability approach, which exploits, in an optimal way, the semantics of metadata schemas. In particular, we propose the use of CIDOC/CRM ontology as a mediating schema and present a methodology for mapping DC Type Vocabulary to CIDOC/CRM, demonstrating a real-world effort for ontology-based metadata integration. |
Learning Textual Entailment using SVMs and String Similarity Measures (Paper in Conference Proceedings) Malakasiotis, Prodromos; Androutsopoulos, Ion Proceedings of the Workshop on Textual Entailment and Paraphrasing, 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), Pages: 42-47, 2007. @inproceedings{Malakasiotis2007, title = {Learning Textual Entailment using SVMs and String Similarity Measures}, author = {Prodromos Malakasiotis and Ion Androutsopoulos}, url = {http://www.aueb.gr/users/ion/docs/rte3_paper.pdf}, year = {2007}, date = {2007-06-01}, booktitle = {Proceedings of the Workshop on Textual Entailment and Paraphrasing, 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007)}, pages = {42-47}, abstract = {We present the system that we submitted to the 3rd Pascal Recognizing Textual Entailment Challenge. It uses four Support Vector Machines, one for each subtask of the challenge, with features that correspond to string similarity measures operating at the lexical and shallow syntactic level.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We present the system that we submitted to the 3rd Pascal Recognizing Textual Entailment Challenge. It uses four Support Vector Machines, one for each subtask of the challenge, with features that correspond to string similarity measures operating at the lexical and shallow syntactic level. |
Ontology-based Metadata Integration in the Cultural Heritage Domain (Paper in Conference Proceedings) Stasinopoulou, Thomais; Bountouri, Lina; Lourdi, Irini; Papatheodorou, Christos; Doerr, Martin; Gergatsoulis, Manolis Proceedings of the 10th International Conference on Asian Digital Libraries, ICADL-2007, Hanoi, Vietnam, December 2007, Lecture Notes in Computer Science (LNCS), Volume: 4822/2007 Pages: 165-175, 2007. @inproceedings{Stasinopoulou2007, title = {Ontology-based Metadata Integration in the Cultural Heritage Domain}, author = {Thomais Stasinopoulou and Lina Bountouri and Irini Lourdi and Christos Papatheodorou and Martin Doerr and Manolis Gergatsoulis}, url = {http://www.springerlink.com/content/k252223528n55127/}, year = {2007}, date = {2007-01-01}, booktitle = {Proceedings of the 10th International Conference on Asian Digital Libraries, ICADL-2007, Hanoi, Vietnam, December 2007, Lecture Notes in Computer Science (LNCS)}, volume = {4822/2007}, pages = {165-175}, abstract = {In this paper, we propose an ontology-based metadata integration methodology for the cultural heritage domain. The proposed real - world approach considers an integration architecture in which CIDOC/CRM ontology acts as a mediating scheme. In this context, we present a mapping methodology from Encoded Archival Description (EAD) and Dublin Core (DC) metadata to CIDOC/CRM, and discuss the faced difficulties.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we propose an ontology-based metadata integration methodology for the cultural heritage domain. The proposed real - world approach considers an integration architecture in which CIDOC/CRM ontology acts as a mediating scheme. In this context, we present a mapping methodology from Encoded Archival Description (EAD) and Dublin Core (DC) metadata to CIDOC/CRM, and discuss the faced difficulties. |
Generating Multilingual Descriptions from Linguistically Annotated OWL Ontologies: the NaturalOWL System (Paper in Conference Proceedings) Galanis, Dimitris; Androutsopoulos, Ion Proceedings of the 11th European Workshop on Natural Language Generation (ENLG 2007), Pages: 143-146, 2007. @inproceedings{Galanis2007, title = {Generating Multilingual Descriptions from Linguistically Annotated OWL Ontologies: the NaturalOWL System}, author = {Dimitris Galanis and Ion Androutsopoulos}, url = {http://www.aueb.gr/users/ion/docs/naturalowl_enlg07.pdf}, year = {2007}, date = {2007-01-01}, booktitle = {Proceedings of the 11th European Workshop on Natural Language Generation (ENLG 2007)}, pages = {143-146}, abstract = {We introduce Naturalowl, an open-source multilingual natural language generator that produces descriptions of instances and classes, starting from a linguistically annotated ontology. The generator is heavily based on ideas from ilex and m-piro, but it is in many ways simpler and it provides full support for owl dl ontologies with rdf linguistic annotations. Naturalowl is written in Java, and it is supported by m-piro’s authoring tool, as well as an alternative plug-in for the Protégé ontology editor.}, keywords = {}, pubstate = {published}, tppubtype = {inproceedings} } We introduce Naturalowl, an open-source multilingual natural language generator that produces descriptions of instances and classes, starting from a linguistically annotated ontology. The generator is heavily based on ideas from ilex and m-piro, but it is in many ways simpler and it provides full support for owl dl ontologies with rdf linguistic annotations. Naturalowl is written in Java, and it is supported by m-piro’s authoring tool, as well as an alternative plug-in for the Protégé ontology editor. |