2022 | |
165. | R. Martoglia, L. Sala, M. Vanzini, R. Vigliermo (2021): A tool for semiautomatic cataloguing of an islamic digital library: a use case from the Digital Maktaba project. 3rd Conference on Digital Curation Technologies (Qurator 2022), Berlin, Germany, 2022. (Type: Inproceeding | Abstract | BibTeX | Tags: Cultural Heritage) @inproceedings{qurator22, title = {A tool for semiautomatic cataloguing of an islamic digital library: a use case from the Digital Maktaba project}, author = {R. Martoglia, L. Sala, M. Vanzini, R. Vigliermo}, year = {2022}, date = {2022-08-23}, booktitle = {In Proceedings of 3rd Conference on Digital Curation Technologies (Qurator 2022), Berlin, Germany}, publisher = {} }
Digital Maktaba (DM) is an interdisciplinary project to create a digital library of texts in non-Latin alphabets (Arabic, Persian, Azerbaijani). The dataset is made available by the digital library heritage of the ”La Pira” library in the history and doctrines of Islam based in Palermo, which is the hub of the Foundation for Religious Sciences (FSCIRE, Bologna). Establishing protocols for the creation, maintenance, and cataloguing of historical content in non-Latin alphabets is the long-term goal of DM. The first step of this project was to create an innovative workflow for automatic extraction of information and metadata from title pages of Arabic script texts. The OCR tool uses various recognition systems, text processing techniques and corpora in order to provide accurate extraction and metadata of document content. In this paper we address the ongoing development of this novel tool and, for the first time, we present a demo of the current version that we have designed for the extraction and cataloguing process by showing a use case on an Arabic book frontispiece. In particular, we delve into the details of the tool workflow for automatically converting and uploading PDFs from the digital library, for the automatic extraction of cataloguing metadata and the semiautomatic (at the current stage) process of cataloguing. We also
shortly discuss future prospects and the many additional features that we are planning to develop. |
164. | R. Martoglia, M. Montangero (2022): About Challenges in Data Analytics and Machine Learning for Social Good. Information, 13(8), 2022, pp. 1-20, ISSN: 2078-2489.. (Type: Journal Article | Abstract | BibTeX | Data Analytics) @article{information22, title = {About Challenges in Data Analytics and Machine Learning for Social Good}, author = {R. Martoglia, M. Montangero}, issn = {2078-2489}, volume = 13, number = 8, pages = 1-16, year = {2022}, date = {2022-07-27},0737-8831 journal = {Information}, pubstate = {published}, tppubtype = {article} }
The large number of new services and applications and, in general, all our everyday activities resolve in data mass production: all these data can become a golden source of information that might be used to improve our lives, wellness and working days. (Interpretable) Machine Learning approaches, the use of which is increasingly ubiquitous in various settings, are definitely one of the most effective tools for retrieving and obtaining essential information from data. However, many challenges arise in order to effectively exploit them. In this paper, we analyze key scenarios in which large amounts of data and machine learning techniques can be used for social good: social network analytics for enhancing cultural heritage dissemination; game analytics to foster Computational Thinking in education; medical analytics to improve the quality of life of the elderly and reduce health care expenses; exploration of work datafication potential in improving the management of human resources (HRM). For the first two of the previously mentioned scenarios, we present new results related to previously published research, framing these results in a more general discussion over challenges arising when adopting machine learning techniques for social good.
|
163. | M. Furini, L. Mariotti, R. Martoglia, M. Montangero (2022): On Designing a Time Sensitive Interaction Graph to Identify Twitter Opinion Leaders. ACM International Conference on Information Technology for Social Good (GoodIT), Limassol, Cyprus 2022. (Type: Inproceeding | Abstract | BibTeX | Tags: Social Data Analytics) @inproceedings{goodit22, title = {On Designing a Time Sensitive Interaction Graph to Identify Twitter Opinion Leaders}, author = {M. Furini, L. Mariotti, R. Martoglia, M. Montangero}, year = {2022}, date = {2022-07-11}, booktitle = {ACM International Conference on Information Technology for Social Good (GoodIT)}, publisher = {ACM} }
What happened on social media during the recent pandemic? Who was the opinion leader of the conversations? Who influenced whom? Were they medical doctors, ordinary people, scientific experts? Did health institutions play an important role in informing and updating citizens?
Identifying opinion leaders within social platforms is of particular importance and, in this paper, we introduce the idea of a time sensitive interaction graph to identify opinion leaders within Twitter conversations. To evaluate our proposal, we focused on all the tweets posted on Twitter in the period 2020-21 and we considered just the ones that were Italian-written and were related to COVID-19. After mapping these tweets into the graph, we applied the PageRank algorithm to extract the opinion leaders of these conversations. Results show that our approach is effective in identifying opinion leaders and therefore it might be used to monitor the role that specific accounts (i.e., health authorities, politicians, city administrators) have within specific conversations.
|
162. | C. Vischioni, F. Bove, M. De Chiara, F. Mandreoli, R. Martoglia, V. Pisi, G. Liti, C. Taccioli (2022): miRNAs Copy Number Variations repertoire as hallmark indicator of cancer species predisposition. Genes, 13(6), 2022, pp. , ISSN: 2073-4425.. (Type: Journal Article | Abstract | BibTeX | Tags: Genomic Data Analytics) @article{genes22, title = {miRNAs Copy Number Variations repertoire as hallmark indicator of cancer species predisposition}, author = {C. Vischioni, F. Bove, M. De Chiara, F. Mandreoli, R. Martoglia, V. Pisi, G. Liti, C. Taccioli}, issn = {2073-4425}, volume = 13, number = 6, pages = , year = {2022}, date = {2022-06-06}, journal = {Genes}, pubstate = {published}, tppubtype = {article} }
... |
161. | S. Bergamaschi, S. De Nardis, R. Martoglia, F. Ruozzi, L. Sala, M. Vanzini and R. A. Vigliermo (2022): Novel perspectives for the management of multilingual and multi-alphabetic heritages through automatic knowledge extraction: The DigitalMaktaba approach. Sensors, 22(11), 2022, pp. 1-20, ISSN: 1424-8220.. (Type: Journal Article | Abstract | BibTeX | Cultural Heritage) @article{sensors22, title = {Novel perspectives for the management of multilingual and multi-alphabetic heritages through automatic knowledge extraction: The DigitalMaktaba approach.}, author = {S. Bergamaschi, S. De Nardis, R. Martoglia, F. Ruozzi, L. Sala, M. Vanzini and R. A. Vigliermo}, issn = {1424-8220}, volume = 22, number = 11, pages = 1-20, year = {2022}, date = {2022-05-24},0737-8831 journal = {Sensors}, pubstate = {published}, tppubtype = {article} }
The linguistic and social impact of multiculturalism can no longer be neglected in any sector, bringing to the urgent need of creating systems and procedures for managing and sharing cultural heritages also in supranational and multi-literate contexts. In order to achieve this goal, text sensing appears as one of the most crucial research areas. The long-term objective of the DigitalMaktaba project, born from the interdisciplinary collaboration between computer scientists, historians, librarians, engineers and linguists, is to establish procedures for the creation, management and cataloguing of archival heritage in non-Latin alphabets.
In this paper, we discuss the currently ongoing design of an innovative workflow and tool in the area of text sensing, for the automatic extraction of knowledge and cataloguing of documents written in non-Latin languages (Arabic, Persian and Azerbaijani). The current prototype leverages different OCR, text processing and information extraction techniques in order to provide both a highly accurate extracted text and a rich metadata content (including automatically identified cataloguing metadata), overcoming typical limitations of current state of the art. The initial tests provide promising results. The paper includes a discussion of future steps (e.g., AI-based techniques further leveraging the extracted data/metadata and making the system learn from user feedback) and of the many foreseen advantages of this research, both from a technical and a broader cultural preservation and sharing point of view.
|
160. | R. Martoglia, G. Savoia (2022): Towards Multi-Model Big Data Road Traffic Forecast at Different Time Aggregations and Forecast Horizons. EAI Endorsed Transactions on Energy Web, 9(39), 2022, pp. , ISSN: 2032-944X.. (Type: Journal Article | Abstract | BibTeX | Data Analytics, ITS Data management, Data Stream) @article{ew22, title = {Towards Multi-Model Big Data Road Traffic Forecast at Different Time Aggregations and Forecast Horizons}, author = {R. Martoglia, G. Savoia}, issn = {2032-944X}, volume = 9, number = 39, pages = , year = {2022}, date = {2022-05-11},0737-8831 journal = {EAI Endorsed Transactions on Energy Web}, pubstate = {published}, tppubtype = {article} }
Due to its usefulness in various social contexts, from Intelligent Transportation Systems (ITSs) to the reduction of urban pollution, road traffic prediction represents an active research area in the scientific community, with strong potential impact on citizens’ well-being. Already considered a non-trivial problem, in many real applications an additional level of complexity is given by the large amount of data requiring Big Data domain technologies. In this paper, we present the first steps of a novel approach integrating both classic and machine learning models in the Spark-based big data architecture of the H2020 CLASS project, and we perform preliminary tests to see how usually little-considered variables (different data aggregation levels, time horizons and traffic density levels) influence the error of the different models.
|
159. | L. Bedogni, G. Cabri, R. Martoglia, F. Poggi (2022): Does the Venue of Scientific Conferences Leverage their Impact? A Large Scale study on Computer Science Conferences. Library Hi Tech, , 2022, pp. , ISSN: 0737-8831.. (Type: Journal Article | Abstract | BibTeX | Tags: Bibliometric data analytics) @article{lht22, title = {Does the Venue of Scientific Conferences Leverage their Impact? A Large Scale study on Computer Science Conferences}, author = {L. Bedogni and G. Cabri and R. Martoglia and F. Poggi}, issn = {0737-8831}, volume = , number = , pages = , year = {2022}, date = {2022-02-14},0737-8831 journal = {Library Hi Tech}, pubstate = {published}, tppubtype = {article} }
Purpose: Conferences bring scientists together and provide one of the most timely means for disseminating new ideas and cutting-edge works. The importance of conferences in many scientific areas is testified by quantitative indexes. The main goal of this paper is to investigate a novel research question: is there any correlation between the impact of scientific conferences and the venue where they took place?
Approach: To measure the impact of conferences we conducted a large scale analysis on the bibliographic data extracted from 3,838 Computer Science conference series and over 2.5 million papers spanning more than 30 years of research. To quantify the “touristicity” of a venue we exploited indexes about the attractiveness of a venue from reports of the World Economic Forum, and we have extracted 4 country-wide and 2 city-wide touristic indexes, which measure the attractiveness and the touristicity of any country or city.
Findings: We found out that the two aspects are related, and the correlation with conference impact is stronger when considering country-wide touristic indexes, achieving a correlation value of more than 0.5 when considering the average citations, and more than 0.8 when considering the total citations. Moreover the almost linear correlation with the Tourist Service Infrastructure index attests the specific importance of tourist / accommodation facilities in a given country.
Originality: This is the first attempt to focus on the relationship of venue characteristics to conference papers. The results open up new possibilities, such as supporting conference organizers in their organization efforts.
Keywords: Citation analysis; Conferences; Information science; Bibliometric analysis; Bibliometric indexes; Research impact; Correlation analysis.
|
158. | R. Cavicchioli, R. Martoglia, M. Verucchi (2022): A Novel Real-Time Edge-Cloud Big Data Management and Analytics Framework for Smart Cities. Journal of Universal Computer Science, 28(1), 2022, pp. 3-26, ISSN: 0948-695X.. (Type: Journal Article | Abstract | BibTeX | Tags: ITS Data Management) @article{jucs22, title = {A Novel Real-Time Edge-Cloud Big Data Management and Analytics Framework for Smart Cities}, author = {R. Cavicchioli and R. Martoglia and M. Verucchi}, issn = {0948-695X}, volume = 28, number = 1, pages = 3--26, year = {2022}, date = {2021-12-07}, journal = {Journal of Universal Computer Science}, pubstate = {published}, tppubtype = {article} }
Exposing city information to dynamic, distributed, powerful, scalable, and user-friendly big data systems is expected to enable the implementation of a wide range of new opportunities; however, the size, heterogeneity and geographical dispersion of data often makes it difficult to combine, analyze and consume them in a single system. In the context of the H2020 CLASS project, we describe an innovative framework aiming to facilitate the design of advanced big-data analytics workflows. The proposal covers the whole compute continuum, from edge to cloud, and relies on a well-organized distributed infrastructure exploiting: a) edge solutions with advanced computer vision technologies enabling the real-time generation of “rich” data from a vast array of sensor types; b) cloud data management techniques offering efficient storage, real-time querying and updating of the high-frequency incoming data at different granularity levels. We specifically focus on obstacle detection and tracking for edge processing, and consider a traffic density monitoring application, with hierarchical data aggregation features for cloud processing; the discussed techniques will constitute the groundwork enabling many further services. The tests are performed on the real use-case of the Modena Automotive Smart Area (MASA).
|
157. | C. Vischioni, F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli (2022): Visual Exploratory Data Analysis for Copy Number Variation Studies in Biomedical Research. Big Data Research (Elsevier), 27, 2022, pp. , ISSN: 2214-5796.. (Type: Journal Article | Abstract | BibTeX | Tags: Genomic Data Analytics) @article{bdr22, title = {Visual Exploratory Data Analysis for Copy Number Variation Studies in Biomedical Research}, author = {C. Vischioni, F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli}, issn = {2214-5796}, volume = 27, number = , pages = , year = {2022}, date = {2021-12-07}, journal = {Big Data Research}, pubstate = {published}, tppubtype = {article} }
The study of Copy Number Variations (CNVs) is recently emerging as a hot topic for biomedical cancer research. While different data sources, websites, and tools concerning genomic CNVs have been made publicly available, CNV data is still a largely unexplored source of biological information, due to the limitations of currently available analysis tools. To this respect, we propose a novel platform, named VarNuCopy, that overcomes such limitations by pursuing the core principles of Exploratory Data Analysis (EDA) in the context of Copy Number Variation (CNV) data. The platform has been made publicly available as a web application, and is, to our best knowledge, the first tool enabling visual, interactive exploration and analysis of the CNV landscape of multiple species. Through novel client and server-side optimizations inspired by scalable data science, VarNuCopy implements a comprehensive and efficient data exploration solution that empowers researchers to easily recognize complex trends and patterns within a huge amount of data concerning CNVs, and to identify new target genes that might be function as tumor suppressor and oncogenes. |
156. | F. Grandi, F. Mandreoli, R. Martoglia, W. Penzo (2022): Unleashing the Power of Querying Streaming Data in a Temporal Database World: A Relational Algebra Approach. Information Systems (Elsevier), 103, 2022, pp. , ISSN: 0306-4379.. (Type: Journal Article | Abstract | BibTeX | Tags: Data stream,Temporal data) @article{is22, title = {Unleashing the Power of Querying Streaming Data in a Temporal Database World: A Relational Algebra Approach}, author = {F. Grandi, F. Mandreoli, R. Martoglia, W. Penzo}, issn = {0306-4379}, volume = 103, number = , pages = , year = {2022}, date = {2021-07-28}, journal = {Information Systems}, pubstate = {published}, tppubtype = {article} } Modern data-intensive applications have to manage huge quantities of streaming/relational data and need advanced query capabilities involving combinations of continuous queries (CQs) and one-time queries (OTQs) also requiring the verification of complex temporal conditions.
In this paper, we go beyond the disjointed panorama of current approaches and adopt a new holistic approach to the integration of stream processing capabilities into the temporal database world based on the streaming table concept. To this end, we propose a full-fledged query interface composed of a TSQL2-like query language with an underlying algebraic framework. The algebraic framework, which is aimed at implementing the query interface on top of a working DBMS, is made up of: (a) the extended temporal algebra TA* supporting OTQs with an hybrid temporal semantics (sequenced and non-sequenced); (b) the continuous temporal algebra CTA that extends TA* with window expressions for CQ specification; (c) the translation of CTA expressions into TA* ones that can be executed by a traditional DBMS with an extended kernel. |
155. | T. Fabbri, A. Scapolan, F. Bertolotti, F. Mandreoli, R. Martoglia (2022): Work datification and digital work behaviour analysis as a source of HRM insights. Do machines dream of electric workers? Understanding the impact of digital technologies on organisations and innovation, Lecture Notes in Information Systems and Organisation (LNISO), Springer, 2022. (Type: Incollection | Abstract | BibTeX | Tags: HRM analytics) @incollection{lniso22, author = {T. Fabbri, A. Scapolan, F. Bertolotti, F. Mandreoli, R. Martoglia}, title = {Work datification and digital work behaviour analysis as a source of HRM insights}, booktitle = {Do machines dream of electric workers? Understanding the impact of digital technologies on organisations and innovation}, publisher = {Springer}, series = {Lecture Notes in Information Systems and Organisation (LNISO)}, pages = {}, year = {2022}, tppubtype = {incollection} } |
154. | M. Furini, F. Mandreoli, R. Martoglia, M. Montangero (2022): A Predictive Method to Improve the Effectiveness of Twitter Communication in a Cultural Heritage Scenario. ACM Journal on Computing and Cultural Heritage (JOCCH), 15(2), 2022, pp. , ISSN: 1556-4673. (Type: Journal Article | Abstract | BibTeX | Tags: Social Data Analytics) @article{jocch21, title = {A Predictive Method to Improve the Effectiveness of Twitter Communication in a Cultural Heritage Scenario}, author = {M. Furini, F. Mandreoli, R. Martoglia, M. Montangero}, issn = {1556-4673}, volume = 15, number = 2, pages = , year = {2022}, date = {2021-06-15}, journal = {ACM Journal on Computing and Cultural Heritage (JOCCH)}, pubstate = {published}, tppubtype = {article} } Museums are embracing social technologies in the attempt to broaden their audience and to engage people. Although social communication seems an easy task, media managers know how hard it is to reach millions of people with a simple message. Indeed, millions of posts are competing every day to get visibility in terms of likes and shares and very little research focused on museums communication to identify best practices. In this paper, we focus on Twitter and we propose a novel method that exploits interpretable machine learning techniques to: (a) predict whether a tweet will likely be appreciated by Twitter users or not; (b) present simple suggestions that will help enhancing the message and increasing the probability of its success. Using a real-world dataset of around 40,000 tweets written by 23 world famous museums, we show that our proposed method allows identifying tweet features that are more likely to influence the tweet success. |
2021 | |
153. | R. Martoglia, G. Savoia (2021): Towards Multi-Model Big Data Road Traffic Forecast at Different Time Aggregations and Forecast Horizons. 8th EAI International Conference on Mobility, IoT and Smart Cities (Mobility IoT 2021), Portugal, 2021. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Analytics, ITS Data management, Data Stream) @inproceedings{mobilityiot21, title = {Towards Multi-Model Big Data Road Traffic Forecast at Different Time Aggregations and Forecast Horizons}, author = {R. Martoglia and G. Savoia}, year = {2021}, date = {2021-11-12}, booktitle = {8th EAI International Conference on Mobility, IoT and Smart Cities (Mobility IoT 2021)}, publisher = {EAI} }
Due to its usefulness in various social contexts, from Intelligent Transportation Systems (ITSs) to the reduction of urban pollution, road traffic prediction represents an active research area in the scientific community, with strong potential impact on citizens’ well-being. Already considered a non-trivial problem, in many real applications an additional level of complexity is given by the large amount of data requiring Big Data domain technologies. In this paper, we present the first steps of a novel approach integrating both classic and machine learning models in the Spark-based big data architecture of the H2020 CLASS project, and we perform preliminary tests to see how usually little-considered variables (different data aggregation levels, time horizons and traffic density levels) influence the error of the different models.
|
152. | R. Martoglia, M. Pontiroli (2021): Let the Games Speak by Themselves: Towards Game Features Discovery Through Data-Driven Analysis and Explainable AI. IEEE International Conference on Data, Information, Knowledge and Wisdom (DIKW), Haikou, China, 2021. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Analytics) @inproceedings{dikw21b, title = {Let the Games Speak by Themselves: Towards Game Features Discovery Through Data-Driven Analysis and Explainable AI}, author = {R. Martoglia and M. Pontiroli}, year = {2021}, date = {2021-07-12}, booktitle = {IEEE International Conference on Data, Information, Knowledge and Wisdom (DIKW)}, publisher = {IEEE} }
The idea behind this work is to start exploring the application of data analytics and (explainable) machine learning techniques to better understand games and discover new features that will possibly help in effectively exploiting them in different socially useful domains. We prove the feasibility of the idea by: (i) collecting a large dataset of board game information; (ii) designing and testing an information processing pipeline for automatically discovering game categories and game mechanics, with some first encouraging results. In the future, we plan to further generalize this approach for different kinds of games and for discovering currently unknown but useful aspects, e.g. games or game features that could better foster Computational Thinking in education, those better suited to be applied in social distancing contexts, and so on. |
151. | S. Bergamaschi, R. Martoglia, F. Ruozzi, R. Vigliermo, S. De Nardis, L. Sala, M. Vanzini (2021): Preserving and conserving culture: first steps towards a knowledge extractor and cataloguer for multilingual and multi-alphabetic heritages. ACM International Conference on Information Technology for Social Good (GoodIT), Rome, Italy 2021. (Type: Inproceeding | Abstract | BibTeX | Tags: Cultural Heritage) @inproceedings{goodit21, title = {Preserving and conserving culture: first steps towards a knowledge extractor and cataloguer for multilingual and multi-alphabetic heritages}, author = {S. Bergamaschi, R. Martoglia, F. Ruozzi, R. Vigliermo, S. De Nardis, L. Sala, M. Vanzini }, year = {2021}, date = {2021-07-12}, booktitle = {ACM International Conference on Information Technology for Social Good (GoodIT)}, publisher = {} }
Managing and sharing cultural heritages also in supranational and multi-literate contexts is a very hot research topic. In this paper we discuss the research we are conducting in the DigitalMaktaba project, presenting the first steps for designing an innovative workflow and tool for the automatic extraction of knowledge from documents written in multiple non-Latin languages (Arabic, Persian and Azerbaijani languages). The tool leverages different OCR, text processing techniques and linguistic corpora in order to provide both a highly accurate extracted text and a rich metadata content, overcoming typical limitations of current state-of-the-art systems; this will enable in the near future the developing of an automatic cataloguer which we hope will ultimately enable a better preservation and conservation of culture in such a demanding scenario. |
150. | R. Martoglia (2021): Invited speech: Data Analytics and (Interpretable) Machine Learning for Social Good. IEEE International Conference on Data, Information, Knowledge and Wisdom (DIKW), Haikou, China, 2021. (Type: Inproceeding (Invited) | Abstract | BibTeX | Tags: Data Analytics) @inproceedings{dikw21, title = {Invited Speech: Data Analytics and (Interpretable) Machine Learning for Social Good}, author = {R. Martoglia}, year = {2021}, date = {2021-07-12}, booktitle = {IEEE International Conference on Data, Information, Knowledge and Wisdom (DIKW)}, publisher = {IEEE} }
In recent years we have been witnessing a real explosion of data in all contexts of our lives. From a research point of view, the need to process data, not only to acquire, store and perform modest operational tasks, but also to analyze and interpret them appropriately, has become more and more a shared need in an ever growing number of applications, with potential benefits not only on our work but also on our life and well being. In this talk, we consider a selection of some of the hottest / most demanding scenarios related to our everyday lives, including medical analytics to improve elder people quality of life and reduce healthcare costs, social network analysis for better cultural heritage diffusion, and exploration of the managerial potential of work datafication for improving Human Resource Management (HRM). In these contexts, we describe the recent results we obtained in our research by applying the latest data analytics techniques, including interpretable machine learning, and discuss the consequent implications and future directions. |
149. | C. Vischioni, F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli (2021): VarNuCopy: from Copy Number Variations to longevity and cancer species predisposition. BITS 2021 - Annual Meeting of the Bioinformatics Italian Society, 2021. (Type: Inproceeding | Abstract | BibTeX | Tags: Genomic Data Analytics) @inproceedings{bits21, title = {VarNuCopy: from Copy Number Variations to longevity and cancer species predisposition}, author = {C. Vischioni, F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli}, year = {2021}, date = {2021-06-30}, booktitle = {BITS 2021 - Annual Meeting of the Bioinformatics Italian Society}, publisher = {} }
VarNuCopy is the first tool to compare multiple CNVs landscape from different species, and to identify genes that appear to be linked to the genome instability. |
2020 | |
148. | G. Guaraldi, D. Ferrari, J. Milic, A. Caselgrandi, A. Malagoli, M. Orsini, F. D’Imprima, M. Mancini, M. Cesari, R. Martoglia, G. Lui, M. Bloch, C. Mussini, P. Missier, F. Mandreoli. (2020): Machine learning vs Knowledge based approach in health outcomes' prediction in HIV. 12th Italian Conference on AIDS and Antiviral Research (ICAR 2020), 2020. (Type: Poster | Abstract | BibTeX | Tags: Medical analytics) @inproceedings{ICAR20, title = {Machine learning vs Knowledge based approach in health outcomes' prediction in HIV}, author = {G. Guaraldi, D. Ferrari, J. Milic, A. Caselgrandi, A. Malagoli, M. Orsini, F. D’Imprima, M. Mancini, M. Cesari, R. Martoglia, G. Lui, M. Bloch, C. Mussini, P. Missier, F. Mandreoli.}, year = {2020}, date = {2020-12-05}, booktitle = {12th Italian Conference on AIDS and Antiviral Research (ICAR 2020)}, } - |
147. | R. Bonacin, M. Fugini, R. Martoglia, O. Nabuco, F. Sais (2020): Web2Touch 2020-21: Semantic Technologies for Smart Information Sharing and Web Collaboration. Proceedings of the 29th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2020), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: semantics) @inproceedings{WETICE20, title = {Web2Touch 2020-21: Semantic Technologies for Smart Information Sharing and Web Collaboration}, author = {R. Bonacin, M. Fugini, R. Martoglia, O. Nabuco, F. Sais}, year = {2020}, date = {2020-04-08}, booktitle = {Proceedings of the 29th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2020)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } This foreword introduces a summary of themes and papers of the Web2Touch (W2T) 2020-21 Track at the 29th IEEE WETICE Conference held as a virtual Conference, in October 2020. W2T 2020-21 includes six full papers and four short papers. They all address relevant issues in the field of information sharing for collaboration, including, big data analytics, knowledge engineering, linked open data, applications of smart Web technologies, and smart care. The papers address a portfolio of hot issues in research and applications of semantics, smart technologies (e.g., IoT, sensors, devices for tele-monitoring, and smart contents management) with crucial topics, such as big data analysis, knowledge representation, smart enterprise management, among the others. This track shows how cooperative technologies based on knowledge representation, intelligent tools, and enhanced Web engineering can enhance collaborative work through smart service design and delivery, so it contributes to radically change the role of the semantic Web and applications. |
146. | F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli, C. Vischioni (2020): VarCopy: a Visual Exploratory Data Analysis Platform for Copy Number Variation Studies. Proceedings of the 24 International Conference Information Visualisation (iV 2020), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: Genomic Data Analytics) @inproceedings{iv2020b, title = {VarCopy: a Visual Exploratory Data Analysis Platform for Copy Number Variation Studies}, author = {F. Bove, F. Mandreoli, R. Martoglia, V. Pisi, C. Taccioli, C. Vischioni}, year = {2020}, date = {2020-08-07}, booktitle = {Proceedings of the 24 International Conference Information Visualisation (iV 2020)}, publisher = {} }
The study of such a complex phenomenon as cancer, which depends on several but unexplored and unclear factors, needs new ways to visualize, analyze and combine different data both on species characteristics and genes function. To this respect, we propose a novel platform, named VarCopy, supporting visual Exploratory Data Analysis (EDA) in the context of Copy Number Variation (CNV) data. The platform will be publicly available as a web application soon, and is, to our best knowledge, the first tool allowing visual, interactive exploration and analysis of the CNV landscape of multiple species, allowing the identification of new target genes that might be useful for biomedical research. |
145. | G. Ghidoni, R. Martoglia, C. Taccioli, C. Vischioni (2020): InstaCircos: a Web Application for Fast and Interactive Circular Visualization of Large Genomic Data. Proceedings of the 24 International Conference Information Visualisation (iV 2020), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: Genomic Data Analytics) @inproceedings{iv2020a, title = {InstaCircos: a Web Application for Fast and Interactive Circular Visualization of Large Genomic Data}, author = {G. Ghidoni, R. Martoglia, C. Taccioli, C. Vischioni}, year = {2020}, date = {2020-08-07}, booktitle = {Proceedings of the 24 International Conference Information Visualisation (iV 2020)}, publisher = {} }
One of the most effective visualizations for genomics data is the circular one, supported by popular packages and visualization suites. Many tools are available, however most of them share a number of negative points including limited ease of installation/usage, slow performance and memory limitations (making them unfeasible for very large genomes such as the human one) and non interactivity. In this paper we present the ongoing work on InstaCircos, a web application born from the scientific collaboration between Big Data Analytics and Bioinformatics researchers and aiming at overcoming the available tools’ limitations. It provides advanced visualization features through an easy to use web interface and offers interactive functionalities and near real-time performances thanks to an integrated big data management back-end based on MongoDB. |
144. | R. Martoglia, M. Montangero (2020): An Intelligent Dashboard for Assisted Tweet Composition in the Cultural Heritage Area (Work-in-progress). Proceedings of 6th EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2020), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: Social Data Analytics) @inproceedings{goodtechs20, title = {An Intelligent Dashboard for Assisted Tweet Compositionin the Cultural Heritage Area (Work-in-progress)}, author = {R. Martoglia, M. Montangero}, year = {2020}, date = {2020-08-07}, booktitle = {Proceedings of 6th EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2020)}, publisher = {} }
Cultural Heritage institutions are nowadays using social media to communicate with citizens and tourists. However, providing actual effective communication is not an easy task, as every day millions of messages are posted through social media. Thus, getting visibility is not trivial. In this paper we present the architecture of a dashboard, accessible by mobile Android devices, to support museum social media managers in composing effective tweets by providing suggestions to improve message drafts. At this aim, the application exploits machine learning techniques over data related to tweets posted by museums in the past. |
143. | D. Ferrari, G. Guaraldi, F. Mandreoli, R. Martoglia, J. Milic, P. Missier (2020): Data-driven vs knowledge-driven inference of health outcomes in the ageing population: a case study. 4th International workshop on Data Analytics solutions for Real-LIfe Applications, co-located with EDBT/ICDT 2020 Joint Conference (DARLI-AP @ EDBT 2020), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: Medical analytics) @inproceedings{DARLI20, title = {Data-driven vs knowledge-driven inference of health outcomes in the ageing population: a case study}, author = {D. Ferrari, G. Guaraldi, F. Mandreoli, R. Martoglia, J. Milic, P. Missier}, year = {2020}, booktitle = {4th International workshop on Data Analytics solutions for Real-LIfe Applications, co-located with EDBT/ICDT 2020 Joint Conference (DARLI-AP @ EDBT 2020)}, } Preventive, Predictive, Personalised and Participative (P4) medicine has the potential to not only vastly improve people’s quality of life, but also to significantly reduce healthcare costs and improve its efficiency. Our research focuses on age-related diseases and explores the opportunities offered by a data-driven approach to predict wellness states of ageing individuals, in contrast to the commonly adopted knowledge-driven approach that relies on easy-to-interpret metrics manually introduced by clinical experts. This is done by means of machine learning models applied on the My Smart Age with HIV (MySAwH) dataset, which is collected through a relatively new approach especially for older HIV patient cohorts. This includes Patient Related Outcomes values from mobile smartphone apps and activity traces from commercial-grade activity loggers. Our results show better predictive performance for the data-driven approach. We also show that a post hoc interpretation method applied to the predictive models can provide intelligible explanations that enable new forms of personalised and preventive medicine. |
142. | B. Pernici, P. Plebani, M. Mecella, F. Leotta, F. Mandreoli, R. Martoglia, G. Cabri (2020): AgileChains: Agile Supply Chains Through Smart Digital Twins. 30th European Safety and Reliability Conference and the 15th Probabilistic Safety Assessment and Management Conference (ESREL), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: Industry 4.0) @inproceedings{esrel20, title = {AgileChains: Agile Supply Chains Through Smart Digital Twins}, author = {B. Pernici, P. Plebani, M. Mecella, F. Leotta, F. Mandreoli, R. Martoglia, G. Cabri}, year = {2020}, booktitle = {30th European Safety and Reliability Conference and the 15th Probabilistic Safety Assessment and Management Conference (ESREL)}, } In Industry 4.0, the digital twin paradigm is currently adopted to represent, simulate and test the behavior of one or more machines and production plants belonging to an organization. This paper introduces the AgileChains paradigm, extending the digital twin to supply chains and the dynamics of their participants. |
141. | T. Fabbri, A. Scapolan, F. Bertolotti, F. Mandreoli, R. Martoglia (2020): Work datification and digital work behaviour analysis as a source of HRM insights. XXI Workshop dei Docenti e Ricercatori di Organizzazione Aziendale (WOA), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: HRM analytics) @inproceedings{woa20, title = {Work datification and digital work behaviour analysis as a source of HRM insights}, author = {T. Fabbri, A. Scapolan, F. Bertolotti, F. Mandreoli, R. Martoglia}, year = {2020}, booktitle = {XXI Workshop dei Docenti e Ricercatori di Organizzazione Aziendale (WOA)}, } |
140. | F. Bertolotti, T. Fabbri, F. Mandreoli, R. Martoglia, A. Scapolan (2020): Work datafication and digital work behavior analysis as a source of social good. IEEE Consumer Communications & Networking Conference (CCNC), 2020. (Type: Inproceeding | Abstract | BibTeX | Tags: HRM analytics) @inproceedings{CCNC20, title = {Work datafication and digital work behavior analysis as a source of social good}, author = {F. Bertolotti, T. Fabbri, F. Mandreoli, R. Martoglia, A. Scapolan}, year = {2020}, booktitle = {IEEE Consumer Communications & Networking Conference (CCNC)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } The digital transformation of organizations is boosting workplace networking and collaboration while making it “observable” with unprecedented timeliness and detail. However, the informational and managerial potential of work datafication is still largely unutilized in Human Resource Management (HRM) and its social benefits, both at the individual and the organizational level, remain largely unexplored. Our research focuses on the relationship between digitally tracked work behaviors and employee attitudes and, in so doing, it explores work datafication as a source of social good. As part of a wider research program, this paper presents some data analysis we performed on a collection of Enterprise Collaboration Software (ECS) data, in search for promising correlations between behavioral and relational (digital) work patterns and employee attitudes.
To this end, we transformed the digital actions performed by 106 employees during a one year period into a graph representation to analyze data under two different points of view: the individual (behavioral) perspective, according to the user who performed the action and the action undertaken, and the social (relational) perspective, making explicit the interactions between users and the objects of their actions. Different employees’ rankings are thus derived and correlated with their attitudes. We discuss the obtained results and their benefits in terms of perspective social good for both the company and the employee. |
139. | G. Cabri, R. Martoglia (2020): A User-Aware and Semantic Approach for Enterprise Search. Natural Language Processing: Concepts, Methodologies, Tools, and Applications, pp. , 2020. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search, Data Sharing) @incollection{nlp20, author = {G. Cabri, R. Martoglia}, title = {A User-Aware and Semantic Approach for Enterprise Search}, booktitle = {Natural Language Processing: Concepts, Methodologies, Tools, and Applications}, pages = {}, year = {2020}, doi = {10.4018/978-1-7998-0951-7.ch016}, pubstate = {published}, tppubtype = {incollection} } This article describes how in addition to general purposes search engines, specialized search engines have appeared and have gained their part of the market. An enterprise search engine enables the search inside the enterprise information, mainly web pages but also other kinds of documents; the search is performed by people inside the enterprise or by customers. This article proposes an enterprise search engine called AMBIT1-SE that relies on two enhancements: first, it is user-aware in the sense that it takes into consideration the profile of the users that perform the query; second, it exploits semantic techniques to consider not only exact matches but also synonyms and related terms. It performs two main activities: (1) information processing to analyse the documents and build the user profile and (2) search and retrieval to search for information that matches user's query and profile. An experimental evaluation of the proposed approach is performed on different real websites, showing its benefits over other well-established approaches. |
2019 | |
138. | G. Guaraldi, M. Orsini, A. Caselgrandi, A. Malagoli, F. D’Imprima, J. Milic, F. Ghinelli, R. Martoglia, F. Mandreoli, D. Ferrari, G. Liu, M. Bloch. (2019): Intrinsic capacity but not frailty predicts functional status in PLWH: a multi-centre prospective study. 10th International Workshop on HIV & Aging, 2019. (Type: Inproceeding | Abstract | BibTeX | Tags: Medical analytics) @inproceedings{HIVA19, title = {Intrinsic capacity but not frailty predicts functional status in PLWH: a multi-centre prospective study}, author = {G. Guaraldi, M. Orsini, A. Caselgrandi, A. Malagoli, F. D’Imprima, J. Milic, F. Ghinelli, R. Martoglia, F. Mandreoli, D. Ferrari, G. Liu, M. Bloch.}, year = {2019}, date = {2019-08-05}, booktitle = {10th International Workshop on HIV & Aging}, } My Smart Age with HIV (MySAwH) is a multi-centre prospective ongoing study with the intention of empowering people living with HIV (PLWH) 50+ years to develop healthy lifestyles. MySAwH is based on collection of physical function data and patient-related outcomes through a dedicated smart-phone app (MySAwH App). Our objective was to describe health changes assessed with frailty index (FI), collected by health professionals, and with a self-generated health measure called intrinsic capacity (IC) index which explores 5 different health domains: locomotion, vitality, sensory, cognition and psychosocial factors. FI and IC were used to predict physical performance at follow up. |
137. | G. Guaraldi, M. Orsini, A. Caselgrandi, A. Malagoli, F. D’Imprima, J. Milic, F. Ghinelli, R. Martoglia, F. Mandreoli, D. Ferrari, G. Liu, M. Bloch. (2019): Fitness tracking wearable devices and a dedicated smart phone app (MySAwH App) to predict quality of life in PLWH: a multi-centre prospective study. 17th European AIDS Conference (EACS), November 6-9, Basel, Switzerland, 2019. (Type: Inproceeding | Abstract | BibTeX | Tags: Medical analytics) @inproceedings{EACS19, title = {Fitness tracking wearable devices and a dedicated smart phone app (MySAwH App) to predict quality of life in PLWH: a multi-centre prospective study.}, author = {G. Guaraldi, M. Orsini, A. Caselgrandi, A. Malagoli, F. D’Imprima, J. Milic, F. Ghinelli, R. Martoglia, F. Mandreoli, D. Ferrari, G. Liu, M. Bloch.}, year = {2019}, date = {2019-08-05}, booktitle = {17th European AIDS Conference (EACS), November 6-9, Basel, Switzerland, 2019.} } My Smart Age with HIV (MySAwH) is a multi-centre prospective ongoing study based on collection of physical function data and patient-related outcomes through a dedicated smart-phone app (MySAwH App). Our objective was to describe health changes assessed with frailty index (FI), collected by health professionals, and a health measure called intrinsic capacity (IC) index which explores 5 different health domains: locomotion, vitality, sensory, cognition, psychosocial. FI and IC were used to predict quality of life (QOL) and health score (HS) at follow-up. |
136. | T. Fabbri, F. Mandreoli, R. Martoglia, A. Scapolan (2019): Employee attitudes and (digital) collaboration data: A preliminary analysis in the HRM field. International Workshop on Social Media Sensing (SMS'19 @ IEEE ICCCN), 2019. (Type: Inproceeding | Abstract | BibTeX | Tags: HRM analytics) @inproceedings{SMS19, title = {Employee attitudes and (digital) collaboration data: A preliminary analysis in the HRM field}, author = {T. Fabbri, F. Mandreoli, R. Martoglia, A. Scapolan}, year = {2019}, date = {2019-08-05}, booktitle = {International Workshop on Social Media Sensing (SMS'19 @ IEEE ICCCN)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } The digital transformation of organizations is making workplace collaboration more and more powerful and work always “observable”; however, the informational and managerial potential of the generated data is still largely unutilized in Human Resource Management (HRM). Our research, conducted in collaboration with business engineers and economists, aims at exploring the relationship between digital work behaviors and employee attitudes. This paper is a work-in-progress contribution that presents a preliminary phase of data analysis we performed on a collection of Enterprise Collaboration Software (ECS) data. In the exploratory data analysis step, we analyze data in their original table format and elaborate it according to the user who performed the action and the performed action. Then, we move to a graph representation in order to make explicit the interaction between users and the objects of their actions. Finally, we introduce the concept of employee-attitude-oriented pattern as a mean to derive significant views over the overall graph and discuss Social Network Analysis (SNA) approaches that can be exploited for our purposes. |
135. | R. Bonacin, M. Fugini, O. Nabuco, R. Martoglia, F. Sais (2019): Web2Touch 2019: Semantic Technologies in Smart Information Sharing and Web Collaboration. Proceedings of the 28th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2019), 2019. (Type: Inproceeding | Abstract | BibTeX | Tags: semantics) @inproceedings{WETICE19, title = {Web2Touch 2019: Semantic Technologies in Smart Information Sharing and Web Collaboration}, author = {R. Bonacin, M. Fugini, O. Nabuco, R. Martoglia, F. Sais}, year = {2019}, date = {2019-04-08}, booktitle = {Proceedings of the 28th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2019)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } This foreword introduces a summary of themes and papers of the Web2Touch (W2T) 2019 Track at the 28th IEEE WETICE Conference held in Capri, June 2019. W2T 2019 includes ten full papers and one short paper. They all address relevant issues in the field of information sharing for collaboration, including, big data analytics, knowledge engineering, linked open data, applications of smart Web technologies, and smart care. The papers are a portfolio of hot issues in research and applications of semantics, smart technologies (e.g., IoT, sensors, devices for tele-monitoring, and smart contents management) with crucial topics, such as big data analysis, knowledge representation, smart enterprise management, among the others. This track shows how cooperative technologies based on knowledge representation, intelligent tools, and enhanced Web engineering can enhance collaborative work through smart service design and delivery, so it contributes to radically change the role of the semantic Web and applications. |
134. | G. Guaraldi, F. Mandreoli, R. Martoglia (2018): Intrinsic Capacity Index in Older Adults living with HIV: data analytics from a prospective clinical trial to standardise a healthy aging tool . 3rd Annual MAQC Society Conference - Reproducibility for Artificial Intelligence in Medicine (MAQC 2019), 2019. (Type: Inproceeding | Abstract | BibTeX | Tags: Medical Data Analytics) @article{maqc19, title = {Intrinsic Capacity Index in Older Adults living with HIV: data analytics from a prospective clinical trial to standardise a healthy aging tool}, author = {G. Guaraldi, F. Mandreoli, R. Martoglia}, year = {2019}, date = {2019-04-08}, booktitle = {3rd Annual MAQC Society Conference - Reproducibility for Artificial Intelligence in Medicine (MAQC 2019)}, publisher = {}, The objective of this study is to follow a data-driven approach to select variables and standardise an Intrinsic Capacity Index (ICI) in relation to age and relevant health outcomes including frailty and HIV status. We will investigate to what extent deep learning and machine learning techniques can be exploited to predict adverse health outcomes and patient’s status from the selected ICI variables. |
133. | F. Grandi, F. Mandreoli, R. Martoglia (2019): Towards Patient-centric Healthcare: Multi-version Ontology-based Personalization of Clinical Guidelines. Semantic Web Science and Real-World
Applications, pp. , 2019. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search, Data Sharing) @incollection{db2019, author = {F. Grandi, F. Mandreoli, R. Martoglia}, title = {Towards Patient-centric Healthcare: Multi-version Ontology-based Personalization of Clinical Guidelines}, booktitle = {Semantic Web Science and Real-World Applications}, pages = {}, year = {2019}, url = {}, doi = {}, pubstate = {published}, tppubtype = {incollection} } Retrieving personalized care plans from a guideline repository is an ever-increasing need in the medical world, not only for physicians but also for empowered patients. In this chapter, we continue our long-lasting research on ontology-based personalized access to very large collections of multi-version documents by addressing a novel challenge: dealing with multi-version clinical guidelines but also with a multi-version ontology used to support personalized access to them. Efficiency is ensured by a newly introduced annotation scheme for guidelines and solutions to cope with the evolution of ontology structure. The tests performed on a prototype implementation confirm the goodness of the approach. Finally, the chapter proposes an exhaustive analysis of the state of the art in this field and, in the final part, a discussion where we expand our vision to related research themes and possible further developments of our work. |
2018 | |
132. | M. Furini, F. Mandreoli, R. Martoglia, M. Montangero (2018): Towards Tweet Content Suggestions for Museum Media Managers. Proceedings of 4th EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2018), 2018. (Type: Inproceeding | Abstract | BibTeX | Tags: Social Data Analytics) @article{goodtechs18, title = {Towards Tweet Content Suggestions for Museum Media Managers}, author = {M. Furini, F. Mandreoli, R. Martoglia, M. Montangero}, year = {2018}, date = {2018-11-23}, booktitle = {Proceedings of 4th EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2018)}, publisher = {}, Cultural Heritage institutions are embracing social technologies in the attempt to provide an effective communication towards citizens. Although it seems easy to reach millions of people with a simple message posted on social media platforms, media managers know that practice is different from theory. Millions of posts are competing every day to get visibility in terms of likes and retweets. The way text, images, hashtags and links are combined together is critical for the visibility of a post. In this paper, we propose to exploit machine learning techniques in order to predict whether a tweet will likely be appreciated by Twitter users or not. Through an experimental assessment, we show that it is possible to provide insights about the tweet features that will likely influence its reception/recommendation among readers. The preliminary tests, performed on a real-world dataset of 19,527 museum tweets, show promising accuracy results. |
131. | R. Bonacin, M. Fugini, O. Nabuco, R. Martoglia (2018): Web2Touch 2018: Semantic Technologies in Smart Information Sharing and Web Collaboration. Proceedings of the 27th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2018), 2018. (Type: Inproceeding | Abstract | BibTeX | Tags: semantics) @inproceedings{WETICE18, title = {Web2Touch 2018: Semantic Technologies in Smart Information Sharing and Web Collaboration}, author = {R. Bonacin, M. Fugini, O. Nabuco, R. Martoglia}, year = {2018}, date = {2018-11-01}, booktitle = {Proceedings of the 27th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2018)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } We present Web2Touch 2018, one of the Tracks at the 27th IEEE WETICE Conference. Web2Touch 2018 includes five full papers and one short paper tackling very hot issues in information sharing and collaboration, including, among others, big data analytics, development of virtual agents and assistants, privacy and security analysis and evaluation. Papers come from areas such as knowledge engineering, linked data, big data, security, safety, and web science. The overall focus is on how research on semantics coupled with crucial topics such as big data analysis, privacy, knowledge representation, and enterprise contents management, among the others, can improve services and collaboration and push forward new ways of interpreting the role of the web. |
130. | R. Martoglia (2018): SocialGQ: Towards Semantically Approximated and User-aware Querying of Social-Graph Data
Proceedings of the 30th International Conference on Software Engineering and Knowledge Engineering (SEKE 2018), pp. , KSI Research Inc. and Knowledge Systems Institute Graduate School 2018, 2018, ISBN: . (Type: Inproceeding | Abstract | BibTeX | Tags: socialgq project, Approximate search) @inproceedings{seke2018, title = {SocialGQ: Towards Semantically Approximated and User-aware Querying of Social-Graph Data}, author = {R. Martoglia}, isbn = {1-891706-35-7}, year = {2018}, date = {2018-05-22}, urldate = {2018-05-22}, booktitle = {Proceedings of the 30th International Conference on Software Engineering and Knowledge Engineering (SEKE 2018)}, pages = {}, publisher = {KSI Research Inc. and Knowledge Systems Institute Graduate School 2018}, abstract = {The proliferation of social and collaborative sites makes users increasingly active in the generation of social-graph data; however, such sea of data often hinders them from finding the information they need. In this paper, we present SocialGQ (“Social-Graph Querying”), a novel approach for the effective and efficient querying of social-graph data overcoming the limitations of typical search approaches proposed in the literature. SocialGQ allows users to compose complex queries in a simple way, and is able to retrieve useful knowledge (top-k answers) by jointly exploiting: (a) the structure of the graph, semantically approximating the user’s requests with meaningful answers; (b) the unstructured textual resources of the graph; (c) its social and user-aware dimension. An experimental evaluation comparing SocialGQ to leading approaches shows strong gains on a real social-graph data scenario.}, keywords = {socialgq project, Approximate search}, pubstate = {published}, tppubtype = {inproceedings} } The proliferation of social and collaborative sites makes users increasingly active in the generation of social-graph data; however, such sea of data often hinders them from finding the information they need. In this paper, we present SocialGQ (“Social-Graph Querying”), a novel approach for the effective and efficient querying of social-graph data overcoming the limitations of typical search approaches proposed in the literature. SocialGQ allows users to compose complex queries in a simple way, and is able to retrieve useful knowledge (top-k answers) by jointly exploiting: (a) the structure of the graph, semantically approximating the user’s requests with meaningful answers; (b) the unstructured textual resources of the graph; (c) its social and user-aware dimension. An experimental evaluation comparing SocialGQ to leading approaches shows strong gains on a real social-graph data scenario. |
129. | G. Cabri, R. Martoglia (2018): A User-aware and Semantic Approach for Enterprise Search. International Journal on Semantic Web and Information Systems, 14(4), 2018, pp. 129-146, ISSN: 1552-6283. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search) @article{ijswis18, title = {A User-aware and Semantic Approach for Enterprise Search}, author = {G. Cabri, R. Martoglia}, issn = {1552-6283}, volume = 14, number = 4, pages = 129-146, year = {2018}, date = {2018-03-06}, journal = {International Journal on Semantic Web and Information Systems}, pubstate = {published}, tppubtype = {article} } In addition to general purposes search engines, specialized search engines have appeared and have gained their part of the market. An enterprise search engine enables the search inside the enterprise information, mainly web pages but also other kinds of documents; the search is performed by people inside the enterprise or by customers. This paper proposes an enterprise search engine called AMBIT-SE that relies on two enhancements: first, it is user-aware in the sense that it takes into consideration the profile of the users that perform the query; second, it exploits semantic techniques to consider not only exact matches but also synonyms and related terms. It performs two main activities: (i) information processing to analyse the documents and build the user profile and (ii) search and retrieval to search for information that matches user's query and profile. An experimental evaluation of the proposed approach is performed on different real websites, showing its benefits over other well-established approaches. |
128. | M. Furini, F. Mandreoli, R. Martoglia and M. Montangero (2018): 5 Steps to Make Art Museums Tweet Influentially. 3rd International Workshop on Social Sensing, (SocialSens 2018), 2018, pp. -. (Type: Inproceedings | Abstract | BibTeX | Tags: Social Data Analytics) @inproceedings{SOCIALSENS18, author = {M. Furini, F. Mandreoli, R. Martoglia and M. Montangero}, title = {5 Steps to Make Art Museums Tweet Influentially}, booktitle = {3rd International Workshop on Social Sensing}, {SocialSens} 2018, April 17, 2018, Orlando, USA}, pages = {-}, year = {2018}, url = {}, doi = {}, } A growing number of museums has started using social networks as different forms of engagement that can act outside museum architectural bounds. Specifically, museum leaders are praising Twitter as a necessary tool to any online programming or presence in museums today. Nevertheless, using Twitter in a satisfactory way so to increase museums’ influence is not an easy task and there has been a gap between its usage and the possibilities it represents. In this paper, we propose an easily understandable framework to analyze the key content factors in museum conversations, including novel formulas for the evaluation of tweets and Twitter accounts influence. We apply the framework to a dataset of 100,000 messages related to 26 museum accounts to understand which museum is more influential in writing tweets, and which features have more impact on the influence of a tweet. Finally, we propose 5 key steps that museums can perform in order to write more influential tweets. |
127. | S. Bergamaschi, D. Beneventano, F. Mandreoli, R. Martoglia, F. Guerra, M. Orsini, L. Po, M. Vincini, G. Simonini, S. Zhu, L. Gagliardelli, L. Magnotta (2018): From Data Integration to Big Data Integration. A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years, pp. 43-59, 2018. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search, Data Sharing) @incollection{db2018, author = {S. Bergamaschi, D. Beneventano, F. Mandreoli, R. Martoglia, F. Guerra, M. Orsini, L. Po, M. Vincini, G. Simonini, S. Zhu, L. Gagliardelli, L. Magnotta}, title = {From Data Integration to Big Data Integration}, booktitle = {A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years.}, pages = {43--59}, year = {2018}, url = {https://doi.org/10.1007/978-3-319-61893-7_3}, doi = {10.1007/978-3-319-61893-7_3}, pubstate = {published}, tppubtype = {incollection} } The Database Group (DBGroup, www.dbgroup.unimore.it) and Information System Group (ISGroup, www.isgroup.unimore.it) research activities have been mainly devoted to the Data Integration Research Area. The DBGroup designed and developed the MOMIS data integration system, giving raise to a successful innovative enterprise DataRiver (www.datariver.it), distributing MOMIS as open source. MOMIS provides an integrated access to structured and semistructured data sources and allows a user to pose a single query and to receive a single unified answer. Description Logics, Automatic Annotation of schemata plus clustering techniques constitute the theoretical framework. In the context of data integration, the ISGroup addressed problems related to the management and querying of heterogeneous data sources in large-scale and dynamic scenarios. The reference architectures are the Peer Data Management Systems and its evolutions toward dataspaces. In these contexts, the ISGroup proposed and evaluated effective and efficient mechanisms for network creation with limited information loss and solutions for mapping management query reformulation and processing and query routing. The main issues of data integration have been faced: automatic annotation, mapping discovery, global query processing, provenance, multidimensional Information integration, keyword search, within European and national projects. With the incoming new requirements of integrating open linked data, textual and multimedia data in a big data scenario, the research has been devoted to the Big Data Integration Research Area. In particular, the most relevant achieved research results are: a scalable entity resolution method, a scalable join operator and a tool, LODEX, for automatically extracting metadata from Linked Open Data (LOD) resources and for visual querying formulation on LOD resources. Moreover, in collaboration with DATARIVER, Data Integration was successfully applied to smart e-health. |
126. | A. Bujari, M. Furini, F. Mandreoli, R. Martoglia, M. Montangero, D. Ronzani (2018): Standards, Security and Business Models: Key Challenges for the IoT Scenario. Mobile Networks and Applications, 23(1), pp. 147-154, 2018, ISSN: 1383-469X. (Type: Journal Article | Abstract | BibTeX | Tags: IoT) @article{monet17, title = {Standards, Security and Business Models: Key Challenges for the IoT Scenario}, author = {A. Bujari, M. Furini, F. Mandreoli, R. Martoglia, M. Montangero, D. Ronzani}, issn = {1383-469X}, volume = 23, number = 1, pages = 147-154, year = {2018}, date = {2017-01-01}, journal = {Mobile Networks and Applications}, abstract = {The number of physical objects connected to the Internet constantly grows and a common thought says the IoT scenario will change the way we live and work. Since IoT technologies have the potential to be pervasive in almost every aspect of a human life, in this paper, we deeply analyze the IoT scenario. First, we describe IoT in simple terms and then we investigate what current technologies can achieve. Our analysis shows four major issues that may limit the use of IoT (i.e., interoperability, security, privacy, and business models) and it highlights possible solutions to solve these problems. Finally, we provide a simulation analysis that emphasizes issues and suggests practical research directions.}, keywords = {IoT}, pubstate = {published}, tppubtype = {article} } The number of physical objects connected to the Internet constantly grows and a common thought says the IoT scenario will change the way we live and work. Since IoT technologies have the potential to be pervasive in almost every aspect of a human life, in this paper, we deeply analyze the IoT scenario. First, we describe IoT in simple terms and then we investigate what current technologies can achieve. Our analysis shows four major issues that may limit the use of IoT (i.e., interoperability, security, privacy, and business models) and it highlights possible solutions to solve these problems. Finally, we provide a simulation analysis that emphasizes issues and suggests practical research directions. |
2017 | |
125. | F. Grandi, F. Mandreoli, R. Martoglia, W. Penzo (2017): A Relational Algebra for Streaming Tables Living in a Temporal Database World. 24th International Symposium on Temporal Representation and Reasoning, (TIME 2017), 2017, pp. 15:1-15:17. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Stream) @inproceedings{TIME17, author = {F. Grandi, F. Mandreoli, R. Martoglia, W. Penzo}, title = {A Relational Algebra for Streaming Tables Living in a Temporal Database World}, booktitle = {24th International Symposium on Temporal Representation and Reasoning, {TIME} 2017, October 16-18, 2017, Mons, Belgium}, pages = {15:1--15:17}, year = {2017}, url = {https://doi.org/10.4230/LIPIcs.TIME.2017.15}, doi = {10.4230/LIPIcs.TIME.2017.15}, } The recently introduced streaming table concept, a fully native representation of streaming data inside a DBMS, enabled modern data-intensive applications with one-time queries (OTQs) and continuous queries (CQs) capabilities on both streaming and standard relational tables. In this paper, we fully acknowledge the temporal nature of streaming tables and we propose to go one step further and integrate them in a temporal DBMS context, where time management is native. Our aim is to break the traditional barrier between the streaming and the temporal worlds, offering complete interoperability between streams and temporal data. To this end, we present a continuous temporal algebra supporting both OTQs and CQs seamlessly on streaming, standard and temporal relational tables. We further show how the transition from continuous to one-time semantics can be managed by defining suitable translation rules, which can also be used as a basis for the implementation of the proposed continuous algebra in a temporal DBMS. |
124. | O. Nabuco, R. Bonacin, R. Martoglia (2017): Web2Touch 2017: Semantic Technologies in Smart Information Sharing and Web Collaboration. Proceedings of the 26th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2017), 2017. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project) @inproceedings{WETICE17, title = {Web2Touch 2017: Semantic Technologies in Smart Information Sharing and Web Collaboration}, author = {O. Nabuco, R. Bonacin, M. Fugini, R. Martoglia}, year = {2017}, date = {2017-11-01}, booktitle = {Proceedings of the 26th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2017)}, publisher = {IEEE}, pubstate = {published}, tppubtype = {inproceedings} } This report presents Web2Touch 2017, a Track at the 26th IEEE WETICE Conference. This year Web2Touch completed 10 editions focusing on scientific and practical works about semantic web as a support for collaborative platforms in their need for sharing knowledge. Web2Touch is an open forum for studies in multiple application domains including, for example, web science, health systems, collaborative learning, smart cooperative systems, and web collaboration and communication in general. Web2Touch 2017 includes five full papers and one short paper. The overall focus of the contributions is on the research of how semantics can improve information sharing, services, and collaboration on "the new web". |
123. | L. Carafoli, F. Mandreoli, R. Martoglia, W. Penzo (2017): Streaming Tables: Native Support to Streaming Data in DBMSs. IEEE Transactions on Systems, Man and Cybernetics: Systems, 47(10), 2017, pp. 2768-2782, ISSN: 1383-469X. (Type: Journal Article | Abstract | BibTeX | Tags: Data Stream) @article{tsmcs17, title = {Streaming Tables: Native Support to Streaming Data in DBMSs}, author = {L. Carafoli, F. Mandreoli, R. Martoglia, W. Penzo}, issn = {2168-2216}, volume = 47, number = 10, pages = 2768-2782, year = {2017}, date = {2017-09-18}, journal = {IEEE Transactions on Systems, Man and Cybernetics: Systems}, abstract = {Data stream management systems (DSMSs) are conceived for running continuous queries (CQs) on the most recently streamed data. This model does not completely fit the needs of several modern data-intensive applications that require to manage recent/historical/static data and execute both CQs and OTQs joining such data. In order to cope with these new needs, some DSMSs have moved toward the integration of database management systems (DBMSs) functionalities to augment their capabilities. In this paper we adopt the opposite perspective and we lay the groundwork for extending DBMSs to natively support streaming facilities. To this end, we introduce a new kind of table, the streaming table, as a persistent structure where streaming data enters and remains stored for a long period, ideally forever. Streaming tables feature a novel access paradigm: continuous writes and one-time as well as continuous reads. We present a streaming table implementation and two novel types of indices that efficiently support both update and scan high rates. A detailed experimental evaluation shows the effectiveness of the proposed technology.}, keywords = {Data Stream}, pubstate = {published}, tppubtype = {article} } Data stream management systems (DSMSs) are conceived for running continuous queries (CQs) on the most recently streamed data. This model does not completely fit the needs of several modern data-intensive applications that require to manage recent/historical/static data and execute both CQs and OTQs joining such data. In order to cope with these new needs, some DSMSs have moved toward the integration of database management systems (DBMSs) functionalities to augment their capabilities. In this paper we adopt the opposite perspective and we lay the groundwork for extending DBMSs to natively support streaming facilities. To this end, we introduce a new kind of table, the streaming table, as a persistent structure where streaming data enters and remains stored for a long period, ideally forever. Streaming tables feature a novel access paradigm: continuous writes and one-time as well as continuous reads. We present a streaming table implementation and two novel types of indices that efficiently support both update and scan high rates. A detailed experimental evaluation shows the effectiveness of the proposed technology. |
122. | M. Furini, F. Mandreoli, R. Martoglia, M. Montangero (2017): The Use of Hashtags in the Promotion of Art Exhibitions. Proceedings of 13th Italian Research Conference on Digital Libraries (IRCDL 2017), pp. 187-198, 2017. (Type: Inproceeding | Abstract | BibTeX | Tags: Social Data Analytics) @article{ircdl17, title = {The Use of Hashtags in the Promotion of Art Exhibitions}, author = {M. Furini, F. Mandreoli, R. Martoglia, M. Montangero}, year = {2017}, pages = {187-198}, date = {2017-01-17}, booktitle = {Proceedings of 13th Italian Research Conference on Digital Libraries (IRCDL 2017)}, publisher = {Springer Verlag}, abstract = {Hashtags are increasingly used to promote, foster and group conversations around specific topics. For example, the entertainment industry widely uses hashtags to increase interest around their products. In this paper, we analyze whether hashtags are effective in a niche scenario like the art exhibitions. The obtained results show very different behaviors and confused strategies: from museums that do not consider hashtags at all, to museums that create official hastags, but hardly mention them; from museums that create multiple hashtags for the same exhibition, to those that are very confused about hashtag usage. Furthermore, we discovered an interesting case, where a smart usage of hashtags stimulated the interest around art. Finally, we highlight few practical guidelines with behaviors to follow and to avoid; the guidelines might help promoting art exhibitions.}, } Hashtags are increasingly used to promote, foster and group conversations around specific topics. For example, the entertainment industry widely uses hashtags to increase interest around their products. In this paper, we analyze whether hashtags are effective in a niche scenario like the art exhibitions. The obtained results show very different behaviors and confused strategies: from museums that do not consider hashtags at all, to museums that create official hastags, but hardly mention them; from museums that create multiple hashtags for the same exhibition, to those that are very confused about hashtag usage. Furthermore, we discovered an interesting case, where a smart usage of hashtags stimulated the interest around art. Finally, we highlight few practical guidelines with behaviors to follow and to avoid; the guidelines might help promoting art exhibitions. |
121. | F. Grandi, F. Mandreoli, R. Martoglia (2017): Multi-version Ontology-based Personalization of Clinical Guidelines for Patient-centric Healthcare. International Journal on Semantic Web and Information Systems, 13 (1), 2017, ISSN: 1552-6283. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search) @article{ijswis17, title = {Multi-version Ontology-based Personalization of Clinical Guidelines for Patient-centric Healthcare}, author = {F. Grandi, F. Mandreoli, R. Martoglia}, issn = {1552-6283}, volume = {13}, number = {1}, year = {2017}, date = {2017-01-01}, journal = {International Journal on Semantic Web and Information Systems}, abstract = {When dealing with a specific patient case, physicians are often interested in retrieving a personalized version of a clinical guideline, that is a version tailored to their use needs. In a patient-centric scenario, empowered patients make up another class of users interested in retrieving personalized care plans from a guideline repository. In our previous work, we proposed techniques to efficiently provide ontology-based personalized access to very large collections of multi-version clinical guidelines. In this paper, we address the problem of also dealing with a multi-version ontology used to support personalized access to clinical guidelines. Our approach allows the semantic indexing of guideline contents with respect to multi-version ontology classes and exploits the IS-A relationship among such classes for granting personalized access. Efficiency is ensured by a newly introduced annotation scheme for guidelines and solutions to cope with the evolution of ontology structure. The tests performed on a prototype implementation confirm the goodness of the approach.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {article} } When dealing with a specific patient case, physicians are often interested in retrieving a personalized version of a clinical guideline, that is a version tailored to their use needs. In a patient-centric scenario, empowered patients make up another class of users interested in retrieving personalized care plans from a guideline repository. In our previous work, we proposed techniques to efficiently provide ontology-based personalized access to very large collections of multi-version clinical guidelines. In this paper, we address the problem of also dealing with a multi-version ontology used to support personalized access to clinical guidelines. Our approach allows the semantic indexing of guideline contents with respect to multi-version ontology classes and exploits the IS-A relationship among such classes for granting personalized access. Efficiency is ensured by a newly introduced annotation scheme for guidelines and solutions to cope with the evolution of ontology structure. The tests performed on a prototype implementation confirm the goodness of the approach. |
2016 | |
120. | M. Furini, F. Mandreoli, R. Martoglia, M. Montangero (2016): IoT: Science Fiction or Real Revolution?. Proceedings of 2nd EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2016), 2016. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project) @article{goodtechs16, title = {IoT: Science Fiction or Real Revolution?}, author = {M. Furini, F. Mandreoli, R. Martoglia, M. Montangero}, year = {2016}, date = {2016-11-23}, booktitle = {Proceedings of 2nd EAI International Conference on Smart Objects and Technologies for Social Good (GoodTechs 2016)}, publisher = {}, abstract = {It’s been many years since media began talking about the wonders of the IoT scenario, where a smart fridge checks the milk expiration date and automatically compiles the shopping list, but in the real life how many people have this smart fridge in the kitchen? Yet the interest around the IoT scenario is growing every day, so in this paper we try to figure out if IoT is science fiction or a real revolution. In particular, we describe in simple terms the IoT scenario, what can be done with current technologies, what are the main obstacles that limit the success and the wide use of IoT and we highlight directions that can make IoT a true reality.}, } The large availability of services, provided by different means such as the Web, smartphone apps, and wearable devices, provides users a valuable support for their everyday activities, but at the same time introduces the need for a tailored choice and exploitation of them. Several approaches have been proposed that take into account users’ preferences, but a comprehensive user-aware approach is still missing.
In this paper we propose a middleware for composing and exploiting services that exhibits some key features: (i) it considers the profile of users that exploit the service to choose appropriate services, (ii) it exploits semantic similarity techniques to make the choice more effective, and (iii) it enables the collaboration among users. By means of a case study we present a possible scenario that can take advantage of our middleware, and show how it can be exploited. |
119. | O. Nabuco, R. Bonacin, M. Fugini, R. Martoglia (2016): Web2Touch 2016: Evolution and security of collaborative web knowledge. Proceedings of the 25th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2016), 2016. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project) @article{WETICE16b, title = {Web2Touch 2016: Evolution and security of collaborative web knowledge}, author = {O. Nabuco, R. Bonacin, M. Fugini, R. Martoglia}, year = {2016}, date = {2016-04-01}, booktitle = {Proceedings of the 25th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2016)}, publisher = {IEEE}, abstract = {This report introduces the Web2Touch 2016, a Track at the 25th IEEE WETICE Conference. This track involves works from collaborative web knowledge research community and related themes. Web2Touch 2016 explores the state-of-the-art on users’ practical experiences, as well as trends and research topics paving the way for future collaborative approaches to knowledge management. Papers come from areas such as computational analysis, management of contextual information, support to personalized information management, collaborative knowledge production, consistency, knowledge engineering and security modeling for multiple knowledge sources. The overall focus is on determining how to route, organize, and present contextual and meaningful information and services to facilitate collaboration.}, keywords = {ambit project}, pubstate = {published}, tppubtype = {inproceedings} } The large availability of services, provided by different means such as the Web, smartphone apps, and wearable devices, provides users a valuable support for their everyday activities, but at the same time introduces the need for a tailored choice and exploitation of them. Several approaches have been proposed that take into account users’ preferences, but a comprehensive user-aware approach is still missing.
In this paper we propose a middleware for composing and exploiting services that exhibits some key features: (i) it considers the profile of users that exploit the service to choose appropriate services, (ii) it exploits semantic similarity techniques to make the choice more effective, and (iii) it enables the collaboration among users. By means of a case study we present a possible scenario that can take advantage of our middleware, and show how it can be exploited. |
118. | D. Beneventano, S. Bergamaschi, R. Martoglia (2016): Exploiting Semantics for Searching Agricultural Bibliographic Data. Journal of Information Science, 42 (6), pp. 748-762, 2016, ISSN: 0165-5515. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search) @article{jis2016, title = {Exploiting Semantics for Searching Agricultural Bibliographic Data}, author = {D. Beneventano and S. Bergamaschi and R. Martoglia}, issn = {0165-5515}, year = {2016}, date = {2016-11-30}, journal = {Journal of Information Science}, volume = {42}, number = {6}, pages = {748-762}, abstract = {Filtering and search mechanisms which permit to identify key bibliographic references are fundamental for researchers. In this paper we propose a fully automatic and semantic method for filtering/searching bibliographic data, which allows users to look for information by specifying simple keyword queries or document queries, i.e. by simply submitting existing documents to the system. The limitations of standard techniques, based on either syntactical text search and on manually assigned descriptors, are overcome by considering the semantics intrinsically associated to the document/query terms; to this aim, we exploit different kinds of external knowledge sources (both general and specific domain dictionaries or thesauri). The proposed techniques have been developed and successfully tested for agricultural bibliographic data, which plays a central role to enable researchers and policy makers to retrieve related agricultural and scientific information by using the AGROVOC thesaurus.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {article} } Filtering and search mechanisms which permit to identify key bibliographic references are fundamental for researchers. In this paper we propose a fully automatic and semantic method for filtering/searching bibliographic data, which allows users to look for information by specifying simple keyword queries or document queries, i.e. by simply submitting existing documents to the system. The limitations of standard techniques, based on either syntactical text search and on manually assigned descriptors, are overcome by considering the semantics intrinsically associated to the document/query terms; to this aim, we exploit different kinds of external knowledge sources (both general and specific domain dictionaries or thesauri). The proposed techniques have been developed and successfully tested for agricultural bibliographic data, which plays a central role to enable researchers and policy makers to retrieve related agricultural and scientific information by using the AGROVOC thesaurus. |
117. | L. Carafoli, F. Mandreoli, R. Martoglia, W. Penzo (2016):
A Data Management Middleware for ITS Services in Smart Cities
Journal of Universal Computer Science, 22 (2), pp. 228-246, 2016, ISSN: 0948-695x. (Type: Journal Article | BibTeX | Tags: Data Stream, ITS Data Management) @article{JUCS16, title = {A Data Management Middleware for ITS Services in Smart Cities}, author = {L. Carafoli and F. Mandreoli and R. Martoglia and W. Penzo}, issn = {0948-695x}, year = {2016}, date = {2016-05-23}, journal = {Journal of Universal Computer Science}, volume = {22}, number = {2}, pages = {228-246}, keywords = {Data Stream, ITS Data Management}, pubstate = {published}, tppubtype = {article} } |
116. | G. Cabri, R. Martoglia, F. Zambonelli (2016): Designing a Collaborative Middleware for Semantic and User-aware Service Composition. Proceedings of the 25th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2016), 2016. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project) @article{WETICE16, title = {Designing a Collaborative Middleware for Semantic and User-aware Service Composition}, author = {G. Cabri, R. Martoglia, F. Zambonelli}, year = {2016}, date = {2016-04-01}, booktitle = {Proceedings of the 25th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (IEEE WETICE 2016)}, publisher = {IEEE}, abstract = {The large availability of services, provided by different means such as the Web, smartphone apps, and wearable devices, provides users a valuable support for their everyday activities, but at the same time introduces the need for a tailored choice and exploitation of them. Several approaches have been proposed that take into account users’ preferences, but a comprehensive user-aware approach is still missing. In this paper we propose a middleware for composing and exploiting services that exhibits some key features: (i) it considers the profile of users that exploit the service to choose appropriate services, (ii) it exploits semantic similarity techniques to make the choice more effective, and (iii) it enables the collaboration among users. By means of a case study we present a possible scenario that can take advantage of our middleware, and show how it can be exploited.}, keywords = {ambit project}, pubstate = {published}, tppubtype = {inproceedings} } The large availability of services, provided by different means such as the Web, smartphone apps, and wearable devices, provides users a valuable support for their everyday activities, but at the same time introduces the need for a tailored choice and exploitation of them. Several approaches have been proposed that take into account users’ preferences, but a comprehensive user-aware approach is still missing.
In this paper we propose a middleware for composing and exploiting services that exhibits some key features: (i) it considers the profile of users that exploit the service to choose appropriate services, (ii) it exploits semantic similarity techniques to make the choice more effective, and (iii) it enables the collaboration among users. By means of a case study we present a possible scenario that can take advantage of our middleware, and show how it can be exploited. |
115. | G. Cabri, S. Gaddi, R. Martoglia (2016): AMBIT-SE: Towards a User-aware Semantic Enterprise Search Engine. Proceedings of the 12th International Conference on Web Information Systems and Technologies (WEBIST 2016), 2016. (Type: Inproceeding | Abstract | BibTeX | Tags: Approximate search) @article{WEBIST16, title = {AMBIT-SE: Towards a User-aware Semantic Enterprise Search Engine}, author = {G. Cabri, S. Gaddi, R. Martoglia}, year = {2016}, date = {2016-01-19}, booktitle = {Proceedings of the 12th International Conference on Web Information Systems and Technologies (WEBIST 2016)}, publisher = {Springer}, abstract = {Search engines represent one of the most exploited tools both in our everyday life and in our work. In this paper we propose a user-aware semantic enterprise search engine called AMBIT-SE. It is "enterprise" in the sense that it is focused on the search in enterprise websites; the "semantic" aspect is related to the fact that it exploits not an exact word match, but relies also on the meaning of the words by means of synonyms and related terms; finally, to produce query results it takes into account also the user information, which turns out to be very useful to improve the search. We explain how our system works and report the results of experiments on different websites.}, keywords = {ambit project}, pubstate = {published}, tppubtype = {inproceedings} } Search engines represent one of the most exploited tools both in our everyday life and in our work. In this paper we propose a user-aware semantic enterprise search engine called AMBIT-SE. It is "enterprise" in the sense that it is focused on the search in enterprise websites; the "semantic" aspect is related to the fact that it exploits not an exact word match, but relies also on the meaning of the words by means of synonyms and related terms; finally, to produce query results it takes into account also the user information, which turns out to be very useful to improve the search. We explain how our system works and report the results of experiments on different websites. |
114. | F. Mandreoli, R. Martoglia, W. Penzo (2016):
Journal of Computer and System Sciences Special Issue on Query Answering on Graph-Structured Data
Journal of Computer and System Sciences, 82 (1), pp. 1-2, 2016, ISSN: 0022-0000. (Type: Journal Article | BibTeX | Tags: Approximate search) @article{JCSS16, title = {Journal of Computer and System Sciences Special Issue on Query Answering on Graph-Structured Data}, author = {F. Mandreoli and R. Martoglia and W. Penzo}, issn = {0022-0000}, year = {2016}, date = {2016-01-21}, journal = {Journal of Computer and System Sciences}, volume = {82}, number = {1}, pages = {1-2}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {article} } |
113. | G. Cabri, M. Leoncini, R. Martoglia, F. Zambonelli (2016): Towards User-aware Service Composition. Proceedings of the 2nd EAI International Conference on Nature of Computation and Communication, Springer, 2016. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project) @inproceedings{ictcc16, title = {Towards User-aware Service Composition}, author = {G. Cabri, M. Leoncini, R. Martoglia, F. Zambonelli}, year = {2016}, date = {2016-01-19}, booktitle = {Proceedings of the 2nd EAI International Conference on Nature of Computation and Communication}, publisher = {Springer}, abstract = {Our everyday life is more and more supported by the information technology in general and specific services provided by means of our electronic devices. The AMBIT project (Algorithms and Models for Building context-dependent Information delivery Tools) aims at providing a support to develop services that are automatically tailored based on the user profile. However, while the adaptation of the single services is the first step, the next step is to achieve adaptation in the composition of different services. In this paper, we explore how services can be composed in a user-aware way, in order to decide the composition that better meets users’ requirements. That is, we exploit the user profile not only to provide her customized services, but also to compose them in a suitable way.}, keywords = {ambit project}, pubstate = {published}, tppubtype = {inproceedings} } Our everyday life is more and more supported by the information technology in general and specific services provided by means of our electronic devices. The AMBIT project (Algorithms and Models for Building context-dependent Information delivery Tools) aims at providing a support to develop services that are automatically tailored based on the user profile. However, while the adaptation of the single services is the first step, the next step is to achieve adaptation in the composition of different services. In this paper, we explore how services can be composed in a user-aware way, in order to decide the composition that better meets users’ requirements. That is, we exploit the user profile not only to provide her customized services, but also to compose them in a suitable way. |
2015 | |
112. | R. Haider, F. Mandreoli, R. Martoglia (2015): Effective Aggregation and Querying of Probabilistic RFID Data in a Location Tracking Context. WSEAS Transactions on Information Science and Applications, 12 , pp. 148-160, 2015, ISSN: 1790-0832. (Type: Journal Article | Abstract | BibTeX | Tags: Outdoor Video Protection Project, Sensors and RFIDs) @article{wseas15, title = {Effective Aggregation and Querying of Probabilistic RFID Data in a Location Tracking Context}, author = {R. Haider and F. Mandreoli and R. Martoglia}, issn = {1790-0832}, year = {2015}, date = {2015-09-28}, journal = {WSEAS Transactions on Information Science and Applications}, volume = {12}, pages = {148-160}, abstract = {RFID applications usually rely on RFID deployments to manage high-level events such as tracking the location that products visit for supply-chain management, localizing intruders for alerting services, and so on. However, transforming low-level streams into high-level events poses a number of challenges. In this paper, we deal with the well known issues of data redundancy and data-information mismatch: we propose an on-line summarization mechanism that is able to provide small space representation for massive RFID probabilistic data streams while preserving the meaningfulness of the information. We also show that common information needs, i.e. detecting complex events meaningful to applications, can be effectively answered by executing temporal probabilistic SQL queries directly on the summarized data. All the techniques presented in this paper are implemented in a complete framework and successfully evaluated in real-world location tracking scenarios.}, keywords = {Outdoor Video Protection Project, Sensors and RFIDs}, pubstate = {published}, tppubtype = {article} } RFID applications usually rely on RFID deployments to manage high-level events such as tracking the location that products visit for supply-chain management, localizing intruders for alerting services, and so on. However, transforming low-level streams into high-level events poses a number of challenges. In this paper, we deal with the well known issues of data redundancy and data-information mismatch: we propose an on-line summarization mechanism that is able to provide small space representation for massive RFID probabilistic data streams while preserving the meaningfulness of the information. We also show that common information needs, i.e. detecting complex events meaningful to applications, can be effectively answered by executing temporal probabilistic SQL queries directly on the summarized data. All the techniques presented in this paper are implemented in a complete framework and successfully evaluated in real-world location tracking scenarios. |
111. | S. Bergamaschi, R. Martoglia, S. Sorrentino (2015): Exploiting semantics for filtering and searching knowledge in a software development context Knowledge and Information Systems, 45 (2), pp. 295-318, 2015, ISSN: 02191377. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, FACIT Project, SE Knowledge Management, Structural Disambiguation) @article{KAIS14, title = {Exploiting semantics for filtering and searching knowledge in a software development context}, author = {S. Bergamaschi and R. Martoglia and S. Sorrentino}, issn = {02191377}, year = {2015}, date = {2015-09-21}, urldate = {2014-12-09}, journal = {Knowledge and Information Systems}, volume = {45}, number = {2}, pages = {295-318}, publisher = {Elsevier}, abstract = {Software development is still considered a bottleneck for Small and Medium Enterprises (SMEs) in the advance of the Information Society. Usually, SMEs store and collect a large number of software textual documentation; these documents might be profitably used to facilitate them in using (and re-using) Software Engineering methods for systematically designing their applications, thus reducing software development cost. Specific and semantics textual filtering/search mechanisms, supporting the identification of adequate processes and practices for the enterprise needs, are fundamental in this context. To this aim, we present an automatic document retrieval method based on semantic similarity and Word Sense Disambiguation techniques. The proposal leverages on the strengths of both classic information retrieval and knowledge-based techniques, exploiting syntactical and semantic information provided by general and specific domain knowledge sources. For any SME, it is as easily and generally applicable as are the search techniques offered by common enterprise Content Management Systems. Our method was developed within the FACIT-SME European FP-7 project, whose aim is to facilitate the diffusion of Software Engineering methods and best practices among SMEs. As shown by a detailed experimental evaluation, the achieved effectiveness goes well beyond typical retrieval solutions.}, keywords = {Approximate search, FACIT Project, SE Knowledge Management, Structural Disambiguation}, pubstate = {published}, tppubtype = {article} } Software development is still considered a bottleneck for Small and Medium Enterprises (SMEs) in the advance of the Information Society. Usually, SMEs store and collect a large number of software textual documentation; these documents might be profitably used to facilitate them in using (and re-using) Software Engineering methods for systematically designing their applications, thus reducing software development cost. Specific and semantics textual filtering/search mechanisms, supporting the identification of adequate processes and practices for the enterprise needs, are fundamental in this context. To this aim, we present an automatic document retrieval method based on semantic similarity and Word Sense Disambiguation techniques. The proposal leverages on the strengths of both classic information retrieval and knowledge-based techniques, exploiting syntactical and semantic information provided by general and specific domain knowledge sources. For any SME, it is as easily and generally applicable as are the search techniques offered by common enterprise Content Management Systems. Our method was developed within the FACIT-SME European FP-7 project, whose aim is to facilitate the diffusion of Software Engineering methods and best practices among SMEs. As shown by a detailed experimental evaluation, the achieved effectiveness goes well beyond typical retrieval solutions. |
110. | F. Mandreoli, R. Martoglia, W. Penzo (2015): Approximating expressive queries on graph-modeled data: The GeX approach
Journal of Systems and Software, 2015 (109), pp. 106-123, 2015, ISSN: 0164-1212. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search) @article{jss15a, title = {Approximating expressive queries on graph-modeled data: The GeX approach}, author = {F. Mandreoli and R. Martoglia and W. Penzo}, issn = {0164-1212}, year = {2015}, date = {2015-08-31}, journal = {Journal of Systems and Software}, volume = {2015}, number = {109}, pages = {106-123}, abstract = {We present the GeX (Graph-eXplorer) approach for the approximate matching of complex queries on graph-modeled data. GeX generalizes existing approaches and provides for a highly expressive graph-based query language that supports queries ranging from keyword-based to structured ones. The GeX query answering model gracefully blends label approximation with structural relaxation, under the primary objective of delivering meaningfully approximated results only. GeX implements ad-hoc data structures that are exploited by a top-k retrieval algorithm which enhances the approximate matching of complex queries. An extensive experimental evaluation on real world datasets demonstrates the efficiency of the GeX query answering.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {article} } We present the GeX (Graph-eXplorer) approach for the approximate matching of complex queries on graph-modeled data. GeX generalizes existing approaches and provides for a highly expressive graph-based query language that supports queries ranging from keyword-based to structured ones. The GeX query answering model gracefully blends label approximation with structural relaxation, under the primary objective of delivering meaningfully approximated results only. GeX implements ad-hoc data structures that are exploited by a top-k retrieval algorithm which enhances the approximate matching of complex queries. An extensive experimental evaluation on real world datasets demonstrates the efficiency of the GeX query answering. |
109. | R. Martoglia (2015): AMBIT: Semantic Engine Foundations for Knowledge Management in Context-dependent Applications
Proceedings of the 27th International Conference on Software Engineering and Knowledge Engineering (SEKE 2015), pp. 146-151, KSI Research Inc. and Knowledge Systems Institute Graduate School 2015, 2015, ISBN: 1-891706-35-7. (Type: Inproceeding | Abstract | BibTeX | Tags: ambit project, Approximate search) @inproceedings{seke2015, title = {AMBIT: Semantic Engine Foundations for Knowledge Management in Context-dependent Applications}, author = {R. Martoglia}, isbn = {1-891706-35-7}, year = {2015}, date = {2015-04-22}, urldate = {2015-04-22}, booktitle = {Proceedings of the 27th International Conference on Software Engineering and Knowledge Engineering (SEKE 2015)}, pages = {146-151}, publisher = {KSI Research Inc. and Knowledge Systems Institute Graduate School 2015}, abstract = {Context-aware application and services proposing potentially useful information to users are more and more widespread; however, their actual usefulness is often limited by the \"syntactical\" notion of context they adopt. The recently started AMBIT project aims to provide a general software architecture for developing semantic-based context-aware tools in a number of vertical case study applications. In this paper, we focus on the knowledge management foundations we are laying for the Semantic Engine of the AMBIT architecture. The proposed semantic analysis and similarity techniques: (a) exploit the textual information deeply characterizing both users and the information to be retrieved; (b) overcome the limits of syntactic methods by leveraging on the strengths of both classic information retrieval and knowledge-based analysis and classification, ultimately proposing information relevant to the user interests. The experimental evaluation of a preliminary implementation in an actual \"cultural territorial enhancement\" scenario already shows promising results.}, keywords = {ambit project, Approximate search}, pubstate = {published}, tppubtype = {inproceedings} } Context-aware application and services proposing potentially useful information to users are more and more widespread; however, their actual usefulness is often limited by the "syntactical" notion of context they adopt. The recently started AMBIT project aims to provide a general software architecture for developing semantic-based context-aware tools in a number of vertical case study applications. In this paper, we focus on the knowledge management foundations we are laying for the Semantic Engine of the AMBIT architecture. The proposed semantic analysis and similarity techniques: (a) exploit the textual information deeply characterizing both users and the information to be retrieved; (b) overcome the limits of syntactic methods by leveraging on the strengths of both classic information retrieval and knowledge-based analysis and classification, ultimately proposing information relevant to the user interests. The experimental evaluation of a preliminary implementation in an actual "cultural territorial enhancement" scenario already shows promising results. |
2014 | |
108. | V. Lomonaco, R. Martoglia, F. Mandreoli, L. Anderlucci, W. Emmett, S. Bicciato, C. Taccioli (2014):
UCbase 2.0: ultraconserved sequences database (2014 update)
Database: the journal of biological databases and curation, 2014 , 2014, ISSN: 17580463. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, Biological data) @article{ucbase14, title = {UCbase 2.0: ultraconserved sequences database (2014 update)}, author = {V. Lomonaco and R. Martoglia and F. Mandreoli and L. Anderlucci and W. Emmett and S. Bicciato and C. Taccioli}, issn = {17580463}, year = {2014}, date = {2014-01-01}, urldate = {2014-09-10}, journal = {Database: the journal of biological databases and curation}, volume = {2014}, abstract = {UCbase 2.0 is an update, extension and evolution of UCbase, a Web tool dedicated to the analysis of ultraconserved sequences (UCRs). UCRs are 481 sequences >200 bases sharing 100% identity among human, mouse and rat genomes. They are frequently located in genomic regions known to be involved in cancer or differentially expressed in human leukemias and carcinomas. UCbase 2.0 is a platform-independent Web resource that includes the updated version of the human genome annotation (hg19), information linking disorders to chromosomal coordinates based on the Systematized Nomenclature of Medicine classification, a query tool to search for Single Nucleotide Polymorphisms (SNPs) and a new text box to directly interrogate the database using a MySQL interface. To facilitate the interactive visual interpretation of UCR chromosomal positioning, UCbase 2.0 now includes a graph visualization interface directly linked to UCSC genome browser. }, keywords = {Approximate search, Biological data}, pubstate = {published}, tppubtype = {article} } UCbase 2.0 is an update, extension and evolution of UCbase, a Web tool dedicated to the analysis of ultraconserved sequences (UCRs). UCRs are 481 sequences >200 bases sharing 100% identity among human, mouse and rat genomes. They are frequently located in genomic regions known to be involved in cancer or differentially expressed in human leukemias and carcinomas. UCbase 2.0 is a platform-independent Web resource that includes the updated version of the human genome annotation (hg19), information linking disorders to chromosomal coordinates based on the Systematized Nomenclature of Medicine classification, a query tool to search for Single Nucleotide Polymorphisms (SNPs) and a new text box to directly interrogate the database using a MySQL interface. To facilitate the interactive visual interpretation of UCR chromosomal positioning, UCbase 2.0 now includes a graph visualization interface directly linked to UCSC genome browser. |
107. | G. Cabri, M. Leoncini, R. Martoglia (2014): AMBIT: Towards an Architecture for the Development of Context-dependent Applications and Systems. Proceedings of the 3rd International Conference on Context-Aware Systems and Applications (ICCASA 2014), Dubai, United Arab Emirates, 2014. (Type: Conference | Abstract | Links | BibTeX | Tags: ambit project, Approximate search) @conference{iccasa14, title = {AMBIT: Towards an Architecture for the Development of Context-dependent Applications and Systems}, author = {G. Cabri and M. Leoncini and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/iccasa14.pdf}, year = {2014}, date = {2014-10-15}, urldate = {2014-09-10}, booktitle = {Proceedings of the 3rd International Conference on Context-Aware Systems and Applications (ICCASA 2014)}, address = {Dubai, United Arab Emirates}, abstract = {The development of ubiquitous services tailored to the needs and expectations of a very large number of potential users (especially mobile users) requires that future applications and systems be aware of the service fruition contexts and possibly of accurate user profiles. The AMBIT research project aims at providing a general model of context as well as a platform that can be exploited to build and deploy different kinds of context-dependent applications and systems. We aim at overcoming the restrictions of the existing approaches, which are mainly due to the limited notion of context they propose (if any). In particular, we stress the fact that current technologies does not accurately consider the notion of context semantics and user profile, which is the main source of the flooding of useless data that overload systems and often users’ minds.}, keywords = {ambit project, Approximate search}, pubstate = {published}, tppubtype = {conference} } The development of ubiquitous services tailored to the needs and expectations of a very large number of potential users (especially mobile users) requires that future applications and systems be aware of the service fruition contexts and possibly of accurate user profiles. The AMBIT research project aims at providing a general model of context as well as a platform that can be exploited to build and deploy different kinds of context-dependent applications and systems. We aim at overcoming the restrictions of the existing approaches, which are mainly due to the limited notion of context they propose (if any). In particular, we stress the fact that current technologies does not accurately consider the notion of context semantics and user profile, which is the main source of the flooding of useless data that overload systems and often users’ minds. |
106. | L. Carafoli, F. Mandreoli, R. Martoglia (2014): Advanced Data Management for real-time data intensive applications and services. Journal of Ambient Intelligence and Smart Environments, 6 (6), pp. 741-742, 2014, ISSN: 1876-1364. (Type: Journal Article | BibTeX | Tags: Data Stream, ITS Data Management, Pegasus project) @article{JAISEPhd1, title = {Advanced Data Management for real-time data intensive applications and services}, author = {L. Carafoli and F. Mandreoli and R. Martoglia}, issn = {1876-1364}, year = {2014}, date = {2014-12-09}, urldate = {2014-12-09}, journal = {Journal of Ambient Intelligence and Smart Environments}, volume = {6}, number = {6}, pages = {741-742}, publisher = {IOS Press}, keywords = {Data Stream, ITS Data Management, Pegasus project}, pubstate = {published}, tppubtype = {article} } |
105. | R. Haider, F. Mandreoli, R. Martoglia (2014): Data management techniques for active RFID applications. Journal of Ambient Intelligence and Smart Environments, 6 (6), pp. 743-744, 2014, ISSN: 1876-1364. (Type: Journal Article | BibTeX | Tags: Outdoor Video Protection Project, Sensors and RFIDs) @article{JAISEPhd2, title = {Data management techniques for active RFID applications}, author = {R. Haider and F. Mandreoli and R. Martoglia}, issn = {1876-1364}, year = {2014}, date = {2014-12-09}, urldate = {2014-12-09}, journal = {Journal of Ambient Intelligence and Smart Environments}, volume = {6}, number = {6}, pages = {743-744}, publisher = {IOS Press}, keywords = {Outdoor Video Protection Project, Sensors and RFIDs}, pubstate = {published}, tppubtype = {article} } |
104. | R. Haider, F. Mandreoli, R. Martoglia (2014): Online filtering and uncertainty management techniques for rfid data processing. WSEAS Transactions on Information Science and Applications, 11 , pp. 231-241, 2014, ISSN: 17900832. (Type: Journal Article | Abstract | BibTeX | Tags: Outdoor Video Protection Project, Sensors and RFIDs) @article{WSEAS14, title = {Online filtering and uncertainty management techniques for rfid data processing}, author = {R. Haider and F. Mandreoli and R. Martoglia}, issn = {17900832}, year = {2014}, date = {2014-12-09}, urldate = {2014-12-09}, journal = {WSEAS Transactions on Information Science and Applications}, volume = {11}, pages = {231-241}, publisher = {World Scientific and Engineering Academy and Society}, abstract = {RFID is one of the emerging technologies for a wide-range of applications, including supply chain and asset management, healthcare and intruder localization. However, the nature of an RFID data stream is noisy, redundant and unreliable, making it unsuitable for direct use in applications. In this paper, we propose specific RFID Online Filtering and Uncertainty Management techniques that operate on unreliable and imprecise data streams in order to transform them into reliable probabilistic data that can be meaningful to the applications. Our proposal makes use of an Hidden Markov Model (HMM) that continuously infers hidden variables (locations, in case of above example) based on sensor readings. The resulting data can be directly stored in a probabilistic database table for further analysis. All the techniques presented in this paper are implemented in a complete framework and succesfully evaluated in real-world object tracking scenarios.}, keywords = {Outdoor Video Protection Project, Sensors and RFIDs}, pubstate = {published}, tppubtype = {article} } RFID is one of the emerging technologies for a wide-range of applications, including supply chain and asset management, healthcare and intruder localization. However, the nature of an RFID data stream is noisy, redundant and unreliable, making it unsuitable for direct use in applications. In this paper, we propose specific RFID Online Filtering and Uncertainty Management techniques that operate on unreliable and imprecise data streams in order to transform them into reliable probabilistic data that can be meaningful to the applications. Our proposal makes use of an Hidden Markov Model (HMM) that continuously infers hidden variables (locations, in case of above example) based on sensor readings. The resulting data can be directly stored in a probabilistic database table for further analysis. All the techniques presented in this paper are implemented in a complete framework and succesfully evaluated in real-world object tracking scenarios. |
103. | R. Haider, F. Mandreoli, R. Martoglia (2014): RPDM: A System for RFID Probabilistic Data Management
Journal of Ambient Intelligence and Smart Environments, 6 (6), pp. 707-722, 2014, ISSN: 1876-1364. (Type: Journal Article | Abstract | BibTeX | Tags: Outdoor Video Protection Project, Sensors and RFIDs) @article{JaiseRFID, title = {RPDM: A System for RFID Probabilistic Data Management}, author = {R. Haider and F. Mandreoli and R. Martoglia}, issn = {1876-1364}, year = {2014}, date = {2014-10-01}, urldate = {2014-10-01}, journal = {Journal of Ambient Intelligence and Smart Environments}, volume = {6}, number = {6}, pages = {707-722}, publisher = {IOS Press}, abstract = {Data streams are more and more commonly generated in a large number of scenarios by audio and video devices, Global Positioning System (GPS), Radio Frequency Identification (RFID) and other types of sensors. In particular, RFID technology has recently gained significant popularity, especially for real-time people and goods tracking, however the noisy, redundant and unreliable nature of RFID streams, coupled with their huge size, can make their exploitation and management difficult. In this paper, we present a realtime system for RFID Probabilistic Data Management (RPDM). The system manages unreliable and noisy raw RFID data and transforms them into reliable meaningful probabilistic data streams by means of a newly proposed method based on a probabilistic Hidden Markov Model (HMM). Moreover, to handle the huge data volume generated by RFID deployments, RPDM proposes and implements a simple on-line summarization mechanism, which is able to provide small space representation for the massive RFID probabilistic data streams while preserving the meaningful information. The results are promptly stored in a probabilistic database, in such a way that a wide range of probabilistic queries can be submitted and answered effectively. The experimental evaluation proves the feasibility of the approach in real-world object tracking scenarios.}, keywords = {Outdoor Video Protection Project, Sensors and RFIDs}, pubstate = {published}, tppubtype = {article} } Data streams are more and more commonly generated in a large number of scenarios by audio and video devices, Global Positioning System (GPS), Radio Frequency Identification (RFID) and other types of sensors. In particular, RFID technology has recently gained significant popularity, especially for real-time people and goods tracking, however the noisy, redundant and unreliable nature of RFID streams, coupled with their huge size, can make their exploitation and management difficult. In this paper, we present a realtime system for RFID Probabilistic Data Management (RPDM). The system manages unreliable and noisy raw RFID data and transforms them into reliable meaningful probabilistic data streams by means of a newly proposed method based on a probabilistic Hidden Markov Model (HMM). Moreover, to handle the huge data volume generated by RFID deployments, RPDM proposes and implements a simple on-line summarization mechanism, which is able to provide small space representation for the massive RFID probabilistic data streams while preserving the meaningful information. The results are promptly stored in a probabilistic database, in such a way that a wide range of probabilistic queries can be submitted and answered effectively. The experimental evaluation proves the feasibility of the approach in real-world object tracking scenarios. |
2013 | |
102. | L. Carafoli, F. Mandreoli, R. Martoglia, W. Penzo (2013):
A Framework for ITS Data Management in a Smart City Scenario
Proceedings of the 2nd International Conference on Smart Grids and Green IT Syste, May 2013 (SmartGreens 2013), Aachen, Germany, 2013. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Stream, ITS Data Management, Pegasus project) @inproceedings{pub102, title = {A Framework for ITS Data Management in a Smart City Scenario}, author = {L. Carafoli and F. Mandreoli and R. Martoglia and W. Penzo}, year = {2013}, date = {2013-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 2nd International Conference on Smart Grids and Green IT Syste, May 2013 (SmartGreens 2013)}, address = {Aachen, Germany}, abstract = {In this paper we introduce a technological framework to efficiently support data management in a modern Intelligent Transportation System (ITS). The proposed technology enables the efficient storage of a variety of recent/historical/static data and guarantees its effective querying by supporting continuous as well as one-time queries for the delivering of real-time traffic services. The framework also offers a scalable solution for coping with the acquisition of huge volumes of data by employing data reduction techniques in Vehicle-to-Infrastructure transmissions. Experimental evaluation on the Linear Road ITS benchmark and along various simulated scenarios demonstrates that the proposed framework efficiently supports smart city data needs.}, keywords = {Data Stream, ITS Data Management, Pegasus project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we introduce a technological framework to efficiently support data management in a modern Intelligent Transportation System (ITS). The proposed technology enables the efficient storage of a variety of recent/historical/static data and guarantees its effective querying by supporting continuous as well as one-time queries for the delivering of real-time traffic services. The framework also offers a scalable solution for coping with the acquisition of huge volumes of data by employing data reduction techniques in Vehicle-to-Infrastructure transmissions. Experimental evaluation on the Linear Road ITS benchmark and along various simulated scenarios demonstrates that the proposed framework efficiently supports smart city data needs. |
101. | C. Grana, G. Serra, M. Manfredi, R. Cucchiara, R. Martoglia, F. Mandreoli (2013): UNIMORE at ImageCLEF 2013: Scalable Concept Image Annotation. Proceedings of the Image Retrieval in Conference and Labs of the Evaluation Forum, September 2013 (ImageClef 2013), Valencia, Spain, 2013. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search) @inproceedings{imageclef13, title = {UNIMORE at ImageCLEF 2013: Scalable Concept Image Annotation}, author = {C. Grana and G. Serra and M. Manfredi and R. Cucchiara and R. Martoglia and F. Mandreoli}, url = {http://www.isgroup.unimore.it/article/clef2013.pdf}, year = {2013}, date = {2013-09-26}, urldate = {2013-07-31}, booktitle = {Proceedings of the Image Retrieval in Conference and Labs of the Evaluation Forum, September 2013 (ImageClef 2013)}, address = {Valencia, Spain}, abstract = {In this paper we propose a large-scale Image annotation system for the Scalable Concept Image Annotation task. For each concept to be detected a separated classifier is built using the provided textual annotation. Images are represented as a Multivariate Gaussian distribution of a set of local features extracted over a dense regular grid. Textual analysis, on the web pages containing training images, is performed to retrieve a relevant set of samples for learning each concept classifier. An online SVMs solver based on Stochastic Gradient Descent is used to manage the large amount of training data. Experimental results show that the combination of different kind of local features encoded with our strategy achieves very competitive performance both in terms of mAP and mean F-measure.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we propose a large-scale Image annotation system for the Scalable Concept Image Annotation task. For each concept to be detected a separated classifier is built using the provided textual annotation. Images are represented as a Multivariate Gaussian distribution of a set of local features extracted over a dense regular grid. Textual analysis, on the web pages containing training images, is performed to retrieve a relevant set of samples for learning each concept classifier. An online SVMs solver based on Stochastic Gradient Descent is used to manage the large amount of training data. Experimental results show that the combination of different kind of local features encoded with our strategy achieves very competitive performance both in terms of mAP and mean F-measure. |
100. | B. Catania, G. Guerrini, A. Belussi, F. Mandreoli, R. Martoglia, W. Penzo (2013): Wearable Queries: Adapting Common Retrieval Needs to Data and Users (Vision Paper). Proceedings of the 7th International Workshop on Ranking in Databases, 30th August 2013 (DBRank 2013), Riva del Garda, Trento, 2013. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing) @inproceedings{dbrank13wq, title = {Wearable Queries: Adapting Common Retrieval Needs to Data and Users (Vision Paper)}, author = {B. Catania and G. Guerrini and A. Belussi and F. Mandreoli and R. Martoglia and W. Penzo}, url = {http://www.isgroup.unimore.it/article/wq-cameraready.pdf}, year = {2013}, date = {2013-08-30}, urldate = {2013-07-31}, booktitle = {Proceedings of the 7th International Workshop on Ranking in Databases, 30th August 2013 (DBRank 2013)}, address = {Riva del Garda, Trento}, abstract = {The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dynamic data and retrieving relevant information from sources having possibly unknown formats and structures. Processing complex requests on such information sources can, thus, be costly, though not guaranteeing user satisfaction. Furthermore, dynamic contexts prevent substantial user involvement in the interpretation of the request. The paper envisions an innovative solution to process the above mentioned requests, limiting user involvement by exploiting information on: (a) user context (geo-location, interests, needs); (b) data and processing quality; (c) similar requests repeated over time. By interpreting a request in a novel way by means of a Wearable Query (WQ), i.e., a query that captures the user and request specificities, we envision a methodological and technological solution for WQs in the presence of repeated information needs in distributed, heterogeneous, dynamic environments, with emphasis on the geo-spatial dimension and on data quality.}, keywords = {Approximate search, Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dynamic data and retrieving relevant information from sources having possibly unknown formats and structures. Processing complex requests on such information sources can, thus, be costly, though not guaranteeing user satisfaction. Furthermore, dynamic contexts prevent substantial user involvement in the interpretation of the request. The paper envisions an innovative solution to process the above mentioned requests, limiting user involvement by exploiting information on: (a) user context (geo-location, interests, needs); (b) data and processing quality; (c) similar requests repeated over time. By interpreting a request in a novel way by means of a Wearable Query (WQ), i.e., a query that captures the user and request specificities, we envision a methodological and technological solution for WQs in the presence of repeated information needs in distributed, heterogeneous, dynamic environments, with emphasis on the geo-spatial dimension and on data quality. |
2012 | |
99. | F. Mandreoli, W. Penzo, S. Rizzi, M. Golfarelli, E. Turricchia (2012): OLAP Query Reformulation in Peer-to-Peer Data Warehousing. Information Systems, 37 (5), pp. 393-411, 2012. (Type: Journal Article | BibTeX | Tags: BIN, Data Sharing) @article{pub93, title = {OLAP Query Reformulation in Peer-to-Peer Data Warehousing}, author = {F. Mandreoli and W. Penzo and S. Rizzi and M. Golfarelli and E. Turricchia}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, journal = {Information Systems}, volume = {37}, number = {5}, pages = {393-411}, keywords = {BIN, Data Sharing}, pubstate = {published}, tppubtype = {article} } |
98. | R. Haider, F. Mandreoli, R. Martoglia, S. Sassatelli (2012): Fast On-Line Summarization of RFID Probabilistic Data Streams. Sixth International Conference on Information Systems, Technology & Manageme, March 2012 (ICISTM 2012), pp. 211-223, Grenoble, France, 2012. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Stream, Outdoor Video Protection Project, Radio Sensors Project) @inproceedings{pub95, title = {Fast On-Line Summarization of RFID Probabilistic Data Streams}, author = {R. Haider and F. Mandreoli and R. Martoglia and S. Sassatelli}, url = {http://www.isgroup.unimore.it/article/icistm12.pdf}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, booktitle = {Sixth International Conference on Information Systems, Technology & Manageme, March 2012 (ICISTM 2012)}, pages = {211-223}, address = {Grenoble, France}, abstract = {RFID applications usually rely on RFID deployments to manage high-level events. A fundamental relation for these purposes is the location of people and objects over time. However, the nature of RFID data streams is noisy, redundant and unreliable and thus streams of low-level tag-reads can be transformed into probabilistic data streams that can reach in practical cases the size of gigabytes in a day. In this paper, we propose a simple on-line summarization mechanism, which is able to provide small space representation for massive RFID probabilistic data streams while preserving the meaningful information. The main idea behind the proposed approach is to keep on aggregating tuples in an incremental way until a state transition is detected. Probabilistic tuples are processed as they arrive, hence avoiding the use of expensive offline disk based operations, and the output is stored in a probabilistic database in such a way that, as we also experimentally prove, a wide range of probabilistic queries can be applicable and answered effectively.}, keywords = {Data Stream, Outdoor Video Protection Project, Radio Sensors Project}, pubstate = {published}, tppubtype = {inproceedings} } RFID applications usually rely on RFID deployments to manage high-level events. A fundamental relation for these purposes is the location of people and objects over time. However, the nature of RFID data streams is noisy, redundant and unreliable and thus streams of low-level tag-reads can be transformed into probabilistic data streams that can reach in practical cases the size of gigabytes in a day. In this paper, we propose a simple on-line summarization mechanism, which is able to provide small space representation for massive RFID probabilistic data streams while preserving the meaningful information. The main idea behind the proposed approach is to keep on aggregating tuples in an incremental way until a state transition is detected. Probabilistic tuples are processed as they arrive, hence avoiding the use of expensive offline disk based operations, and the output is stored in a probabilistic database in such a way that, as we also experimentally prove, a wide range of probabilistic queries can be applicable and answered effectively. |
97. | M. Ceci, M. Coluccia, F. Fumarola, P. H. Guzzi, F. Mandreoli, R. Martoglia, E. Masciari, M. Mecella, W. Penzo (2012): A Framework For Biological Data Normalization, Interoperability, and Mining for Cancer Microenvironment Analysis. Proceedings of the 20th Italian Symposium on Advanced Database Syste, June 2012 (SEBD 2012), pp. 67-74, Venezia, Italia, 2012. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Biological data, Data Sharing) @inproceedings{pub96, title = {A Framework For Biological Data Normalization, Interoperability, and Mining for Cancer Microenvironment Analysis}, author = {M. Ceci and M. Coluccia and F. Fumarola and P. H. Guzzi and F. Mandreoli and R. Martoglia and E. Masciari and M. Mecella and W. Penzo}, url = {http://www.isgroup.unimore.it/article/sebd12.pdf}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 20th Italian Symposium on Advanced Database Syste, June 2012 (SEBD 2012)}, pages = {67-74}, address = {Venezia, Italia}, abstract = {Over the last decade, the advances in the high-throughput omic technologies have given the possibility to profile tumor cells at different levels, fostering the discovery of new biological data and the proliferation of a large number of bio-technological databases. In this paper we describe a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in a such huge quantity of data. In this framework, a key role is played by the Connectivity Map, a databank which relates diseases, physiological processes, and the action of drugs. The system will be used in a pilot study on the Multiple Myeloma (MM).}, keywords = {Biological data, Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } Over the last decade, the advances in the high-throughput omic technologies have given the possibility to profile tumor cells at different levels, fostering the discovery of new biological data and the proliferation of a large number of bio-technological databases. In this paper we describe a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in a such huge quantity of data. In this framework, a key role is played by the Connectivity Map, a databank which relates diseases, physiological processes, and the action of drugs. The system will be used in a pilot study on the Multiple Myeloma (MM). |
96. | S. Bergamaschi, R. Martoglia, S. Sorrentino (2012):
A Semantic Method for Searching Knowledge in a Software Development Context
Proceedings of the 20th Italian Symposium on Advanced Database Systems, June 2012 (SEBD 2012), pp. 115-122, Venezia, Italia, 2012. (Type: Inproceeding | Abstract | BibTeX | Tags: Approximate search, FACIT Project, SE Knowledge Management) @inproceedings{pub97, title = {A Semantic Method for Searching Knowledge in a Software Development Context}, author = {S. Bergamaschi and R. Martoglia and S. Sorrentino}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 20th Italian Symposium on Advanced Database Systems, June 2012 (SEBD 2012)}, pages = {115-122}, address = {Venezia, Italia}, abstract = {The FACIT-SME European FP-7 project targets to facilitate the use and sharing of Software Engineering (SE) methods and best practices among software developing SMEs. In this context, we present an automatic semantic document searching method based on Word Sense Disambiguation which exploits both syntactic and semantic information provided by external dictionaries and is easily applicable for any SME.}, keywords = {Approximate search, FACIT Project, SE Knowledge Management}, pubstate = {published}, tppubtype = {inproceedings} } The FACIT-SME European FP-7 project targets to facilitate the use and sharing of Software Engineering (SE) methods and best practices among software developing SMEs. In this context, we present an automatic semantic document searching method based on Word Sense Disambiguation which exploits both syntactic and semantic information provided by external dictionaries and is easily applicable for any SME. |
95. | L. Carafoli, F. Mandreoli, R. Martoglia, W. Penzo (2012): Evaluation of Data Reduction Techniques for Vehicle to Infrastructure Communication Saving Purposes. Proceedings of the 16th International Database Engineering & Applications Symposi, August 2012 (IDEAS 2012), pp. 61-70, Prague, Czech Republic, 2012. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Stream, ITS Data Management, Pegasus project) @inproceedings{pub98, title = {Evaluation of Data Reduction Techniques for Vehicle to Infrastructure Communication Saving Purposes}, author = {L. Carafoli and F. Mandreoli and R. Martoglia and W. Penzo}, url = {http://www.isgroup.unimore.it/article/ideas12.pdf}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 16th International Database Engineering & Applications Symposi, August 2012 (IDEAS 2012)}, pages = {61-70}, address = {Prague, Czech Republic}, abstract = {In this paper we investigate the employment of different data reduction techniques to minimize V2I communication in an Intelligent Transportation System (ITS). We consider the context of the PEGASUS Project, where vehicles are equipped with sensor-based devices able to compute information like vehicle\'s position and speed that communicate this traffic data to a Control Centre (CC). The CC relies on a general-purpose data management module that supports the execution of continuous queries as well as standard SQL one-time queries on the collected data to provide various infomobility services, spanning from traffic monitoring to location-based ones. The traffic scenarios envisioned in PEGASUS are the more disparate, ranging from highway roads, that characterize typical US traffic schemes, to urban areas, that represent most European transport realities. The paper explores two categories of data reduction techniques: independent techniques, where vehicles autonomously send data to the CC, and information-need techniques, where data is sent by taking into account additional data received from the CC. The former are attracting since they do not require additional communication costs due to CC\'s data transmission. The latter are interesting because they implement sophisticated mechanisms that are much more effective for specific traffic monitoring services. These could be profitably employed under specific CC\'s workloads. The paper discusses and implements the technical changes needed at the CC to support the required infomobility services under the reduced availability of data. All investigated techniques have been extensively evaluated in a variety of traffic scenarios.}, keywords = {Data Stream, ITS Data Management, Pegasus project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we investigate the employment of different data reduction techniques to minimize V2I communication in an Intelligent Transportation System (ITS). We consider the context of the PEGASUS Project, where vehicles are equipped with sensor-based devices able to compute information like vehicle's position and speed that communicate this traffic data to a Control Centre (CC). The CC relies on a general-purpose data management module that supports the execution of continuous queries as well as standard SQL one-time queries on the collected data to provide various infomobility services, spanning from traffic monitoring to location-based ones. The traffic scenarios envisioned in PEGASUS are the more disparate, ranging from highway roads, that characterize typical US traffic schemes, to urban areas, that represent most European transport realities. The paper explores two categories of data reduction techniques: independent techniques, where vehicles autonomously send data to the CC, and information-need techniques, where data is sent by taking into account additional data received from the CC. The former are attracting since they do not require additional communication costs due to CC's data transmission. The latter are interesting because they implement sophisticated mechanisms that are much more effective for specific traffic monitoring services. These could be profitably employed under specific CC's workloads. The paper discusses and implements the technical changes needed at the CC to support the required infomobility services under the reduced availability of data. All investigated techniques have been extensively evaluated in a variety of traffic scenarios. |
94. | M. Ceci, F. Fumarola, P. H. Guzzi, F. Mandreoli, R. Martoglia, E. Masciari, M. Mecella, W. Penzo (2012): Toward a Semantic Framework for the Querying, Mining and Visualization of Cancer Microenvironment Data. Proceedings of the 3rd International Conference on Information Technology in Bio- and Medical Informati, September 2012 (ITBAM 2012), pp. 109-123, Vienna, Austria, 2012. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Biological data, Data Sharing) @inproceedings{pub99, title = {Toward a Semantic Framework for the Querying, Mining and Visualization of Cancer Microenvironment Data}, author = {M. Ceci and F. Fumarola and P. H. Guzzi and F. Mandreoli and R. Martoglia and E. Masciari and M. Mecella and W. Penzo}, url = {http://www.isgroup.unimore.it/article/itbam12.pdf}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 3rd International Conference on Information Technology in Bio- and Medical Informati, September 2012 (ITBAM 2012)}, pages = {109-123}, address = {Vienna, Austria}, abstract = {Over the last decade, the advances in the high-throughput omic technologies have given the possibility to profile tumor cells at different levels, fostering the discovery of new biological data and the proliferation of a large number of bio-technological databases. In this paper we describe a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in such a huge quantity of data. The system will be used in a pilot study on the Multiple Myeloma (MM).}, keywords = {Biological data, Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } Over the last decade, the advances in the high-throughput omic technologies have given the possibility to profile tumor cells at different levels, fostering the discovery of new biological data and the proliferation of a large number of bio-technological databases. In this paper we describe a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in such a huge quantity of data. The system will be used in a pilot study on the Multiple Myeloma (MM). |
93. | F. Grandi, F. Mandreoli, R. Martoglia (2012): Efficient management of multi-version clinical guidelines. Journal of Biomedical Informatics (JBI), 45 (6), 2012. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, Data Versioning) @article{pub100, title = {Efficient management of multi-version clinical guidelines}, author = {F. Grandi and F. Mandreoli and R. Martoglia}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, journal = {Journal of Biomedical Informatics (JBI)}, volume = {45}, number = {6}, abstract = {Clinical medicine and health-care developments in recent years testified a tremendous increase in the number of available guidelines, i.e., best practices encoding and standardizing care procedures for a given disease. Clinical guidelines are subject to continuous development and revision by committees of expert physicians and health authorities and, thus, multiple versions coexist as a consequence of the clinical and healthcare activities. Moreover, several alternatives are usually included in order to make the guidelines as general as possible, making them difficult to handle both in manual and automated fashions. In this work, we will introduce techniques to model and to provide efficient personalized access to very large collections of multi-version clinical guidelines, which can be stored both in textual and in executable format in an XML repository. In this way, multiple temporal perspectives, patient profile and context information can be used by an automated personalization service to efficiently build on demand a guideline version tailored to a specific use case.}, keywords = {Approximate search, Data Versioning}, pubstate = {published}, tppubtype = {article} } Clinical medicine and health-care developments in recent years testified a tremendous increase in the number of available guidelines, i.e., best practices encoding and standardizing care procedures for a given disease. Clinical guidelines are subject to continuous development and revision by committees of expert physicians and health authorities and, thus, multiple versions coexist as a consequence of the clinical and healthcare activities. Moreover, several alternatives are usually included in order to make the guidelines as general as possible, making them difficult to handle both in manual and automated fashions. In this work, we will introduce techniques to model and to provide efficient personalized access to very large collections of multi-version clinical guidelines, which can be stored both in textual and in executable format in an XML repository. In this way, multiple temporal perspectives, patient profile and context information can be used by an automated personalization service to efficiently build on demand a guideline version tailored to a specific use case. |
92. | M. Ceci, M. Coluccia, F. Fumarola, P. H. Guzzi, F. Mandreoli, R. Martoglia, E. Masciari, M. Mecella, W. Penzo (2012): The IS-BioBank Project: A Framework for Biological Data Normalization, Interoperability, and Mining for Cancer Microenvironment Analysis. SIGHIT Record (SIGHIT), 2 (2), pp. 16-21, 2012. (Type: Journal Article | Abstract | BibTeX | Tags: Biological data, Data Sharing) @article{pub101, title = {The IS-BioBank Project: A Framework for Biological Data Normalization, Interoperability, and Mining for Cancer Microenvironment Analysis}, author = {M. Ceci and M. Coluccia and F. Fumarola and P. H. Guzzi and F. Mandreoli and R. Martoglia and E. Masciari and M. Mecella and W. Penzo}, year = {2012}, date = {2012-01-01}, urldate = {2013-06-12}, journal = {SIGHIT Record (SIGHIT)}, volume = {2}, number = {2}, pages = {16-21}, abstract = {Advances of high throughput technologies have yielded the possibility to investigate human cells of healthy and morbid ones at different levels. Consequently, this has made possible the discovery of new biological and biomedical data and the proliferation of a large number of databases. In this paper, we describe the IS-BioBank (Integrated Semantic Biological Data Bank) proposal. It consists of the realization of a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in such a huge quantity of data. In this framework, a key role has been played by the Connectivity Map, a databank which relates diseases, physiological processes, and the action of drugs. The system will be used in a pilot study on the Multiple Myeloma (MM).}, keywords = {Biological data, Data Sharing}, pubstate = {published}, tppubtype = {article} } Advances of high throughput technologies have yielded the possibility to investigate human cells of healthy and morbid ones at different levels. Consequently, this has made possible the discovery of new biological and biomedical data and the proliferation of a large number of databases. In this paper, we describe the IS-BioBank (Integrated Semantic Biological Data Bank) proposal. It consists of the realization of a framework for enabling the interoperability among different biological data sources and for ultimately supporting expert users in the complex process of extraction, navigation and visualization of the precious knowledge hidden in such a huge quantity of data. In this framework, a key role has been played by the Connectivity Map, a databank which relates diseases, physiological processes, and the action of drugs. The system will be used in a pilot study on the Multiple Myeloma (MM). |
91. | L. Carafoli (2012): Data Management in a Modern ITS: Problems and Solutions. Conceptual Modeling - 31st International Conference ER 2012, pp. 584-589, 2012. (Type: Inproceeding | BibTeX | Tags: Data Stream, ITS Data Management, Pegasus project) @inproceedings{DBLP:conf/er/Carafoli12, title = {Data Management in a Modern ITS: Problems and Solutions}, author = {L. Carafoli}, year = {2012}, date = {2012-10-23}, urldate = {2013-07-09}, booktitle = {Conceptual Modeling - 31st International Conference ER 2012}, pages = {584-589}, keywords = {Data Stream, ITS Data Management, Pegasus project}, pubstate = {published}, tppubtype = {inproceedings} } |
2011 | |
90. | R. Lenzi, C. Gennaro, F. Mandreoli, R. Martoglia, M. Mordacchini, W. Penzo, S. Sassatelli (2011): A Unified Multimedia and Semantic Perspective for Data Retrieval in the Semantic Web. Information Systems (Information), 36 (2), pp. 174-191, 2011. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @article{pub82, title = {A Unified Multimedia and Semantic Perspective for Data Retrieval in the Semantic Web}, author = {R. Lenzi and C. Gennaro and F. Mandreoli and R. Martoglia and M. Mordacchini and W. Penzo and S. Sassatelli}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, journal = {Information Systems (Information)}, volume = {36}, number = {2}, pages = {174-191}, abstract = {In recent years, the emerging diffusion of peer-to-peer networks is going beyond the single-domain paradigm like, for instance, the mono-thematic file sharing one (e.g., Napster for music). Peers are more and more heterogeneous data sources which need to share data with commercial, educational, and/or collaboration purposes, just to mention a few. Moreover, in current information processing applications data can not be meaningfully searched by precise database queries that would return exact matches (e.g., when dealing with multimedia, proteomic, statistical data). In this paper we move a step towards multi-domain multi-type data sharing systems by introducing an advanced technological infrastructure which enables users to meet these new emerging needs. A fundamental issue in this context is data heterogeneity, which is pervasive and intrinsically present both at intensional level where, due to peers\' autonomy, different semantic descriptions of the available information are provided, and at extensional level, where multiple data types can coexist, also including content-based searchable data types such as multimedia data. Our proposal relies on a Peer Data Management Systems (PDMS) framework to present innovative network organization and query routing mechanisms which exploit both peers\' data description and data content to achieve effective and efficient network management and data retrieval in such a context. The validity of our proposal is demonstrated by an absolutely satisfactory experimental evaluation on a real setting.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {article} } In recent years, the emerging diffusion of peer-to-peer networks is going beyond the single-domain paradigm like, for instance, the mono-thematic file sharing one (e.g., Napster for music). Peers are more and more heterogeneous data sources which need to share data with commercial, educational, and/or collaboration purposes, just to mention a few. Moreover, in current information processing applications data can not be meaningfully searched by precise database queries that would return exact matches (e.g., when dealing with multimedia, proteomic, statistical data). In this paper we move a step towards multi-domain multi-type data sharing systems by introducing an advanced technological infrastructure which enables users to meet these new emerging needs. A fundamental issue in this context is data heterogeneity, which is pervasive and intrinsically present both at intensional level where, due to peers' autonomy, different semantic descriptions of the available information are provided, and at extensional level, where multiple data types can coexist, also including content-based searchable data types such as multimedia data. Our proposal relies on a Peer Data Management Systems (PDMS) framework to present innovative network organization and query routing mechanisms which exploit both peers' data description and data content to achieve effective and efficient network management and data retrieval in such a context. The validity of our proposal is demonstrated by an absolutely satisfactory experimental evaluation on a real setting. |
89. | F. Mandreoli, R. Martoglia (2011): Knowledge-Based Sense Disambiguation (Almost) For All Structures. Information Systems (Information), 36 (2), pp. 406-430, 2011. (Type: Journal Article | Abstract | BibTeX | Tags: NeP4B Project, Structural Disambiguation) @article{pub86, title = {Knowledge-Based Sense Disambiguation (Almost) For All Structures}, author = {F. Mandreoli and R. Martoglia}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, journal = {Information Systems (Information)}, volume = {36}, number = {2}, pages = {406-430}, abstract = {Structural disambiguation is acknowledged as a very real and frequent problem for many semantic-aware applications. In this paper, we propose a unified answer to sense disambiguation on a large variety of structures both at data and metadata level such as relational schemas, XML data and schemas, taxonomies, and ontologies. Our knowledge-based approach achieves general applicability by converting the input structures into a common format and by allowing users to tailor the extraction of the context to the specific application needs and structure characteristics. Flexibility is ensured by supporting the combination of different disambiguation methods together with different information extracted from different sources of knowledge. Further, we support both assisted and completely automatic semantic annotation tasks, while several novel feedback techniques allow us to improve the initial disambiguation results without necessarily requiring user intervention. An extensive evaluation of the obtained results shows the good effectiveness of the proposed solutions on a large variety of structure-based information and disambiguation requirements.}, keywords = {NeP4B Project, Structural Disambiguation}, pubstate = {published}, tppubtype = {article} } Structural disambiguation is acknowledged as a very real and frequent problem for many semantic-aware applications. In this paper, we propose a unified answer to sense disambiguation on a large variety of structures both at data and metadata level such as relational schemas, XML data and schemas, taxonomies, and ontologies. Our knowledge-based approach achieves general applicability by converting the input structures into a common format and by allowing users to tailor the extraction of the context to the specific application needs and structure characteristics. Flexibility is ensured by supporting the combination of different disambiguation methods together with different information extracted from different sources of knowledge. Further, we support both assisted and completely automatic semantic annotation tasks, while several novel feedback techniques allow us to improve the initial disambiguation results without necessarily requiring user intervention. An extensive evaluation of the obtained results shows the good effectiveness of the proposed solutions on a large variety of structure-based information and disambiguation requirements. |
88. | R. Cucchiara, M. Fornaciari, R. Haider, F. Mandreoli, R. Martoglia, A. Prati, S. Sassatelli (2011): A Reasoning Engine for Intruders' Localization in Wide Open Areas using a Network of Cameras and RFIDs. Proceedings of the 1st IEEE Workshop on Camera Networks and Wide Area Scene Analysis, June 2011 (IEEE WCNWASA 2011), Colorado Springs, USA, 2011. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Stream, Outdoor Video Protection Project) @inproceedings{pub89, title = {A Reasoning Engine for Intruders\' Localization in Wide Open Areas using a Network of Cameras and RFIDs}, author = {R. Cucchiara and M. Fornaciari and R. Haider and F. Mandreoli and R. Martoglia and A. Prati and S. Sassatelli}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st IEEE Workshop on Camera Networks and Wide Area Scene Analysis, June 2011 (IEEE WCNWASA 2011)}, address = {Colorado Springs, USA}, abstract = {Wide open areas represent challenging scenarios for surveillance systems, especially when sensors are affected by noise, uncertainty, distractors and complex scenarios. Therefore, the tasks of localizing and identifying targets (e.g., people) in such environments require to go beyond the use of camera-only deployments. In this paper, we propose an innovative system for wide open area intruder detection relying on the joint use of cameras and RFIDs, allowing us to map RFID tags to people detected by cameras and, thus, highlighting potential intruders. To this end, sophisticated filtering techniques preserve the uncertainty of data and overcome the heterogeneity of sensors, while an evidential fusion architecture, based on Transferable Belief Model, combines the two sources of information and manages conflict between them. The conducted experimental evaluation shows promising results.}, keywords = {Data Stream, Outdoor Video Protection Project}, pubstate = {published}, tppubtype = {inproceedings} } Wide open areas represent challenging scenarios for surveillance systems, especially when sensors are affected by noise, uncertainty, distractors and complex scenarios. Therefore, the tasks of localizing and identifying targets (e.g., people) in such environments require to go beyond the use of camera-only deployments. In this paper, we propose an innovative system for wide open area intruder detection relying on the joint use of cameras and RFIDs, allowing us to map RFID tags to people detected by cameras and, thus, highlighting potential intruders. To this end, sophisticated filtering techniques preserve the uncertainty of data and overcome the heterogeneity of sensors, while an evidential fusion architecture, based on Transferable Belief Model, combines the two sources of information and manages conflict between them. The conducted experimental evaluation shows promising results. |
87. | R. Martoglia (2011): Facilitate IT-Providing SMEs in Software Development: a Semantic Helper for Filtering and Searching Knowledge. Proceedings of the 23rd International Conference on Software Engineering and Knowledge Engineering, July 2011 (SEKE 2011), pp. 130-136, Miami Beach, USA, 2011. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, FACIT Project, SE Knowledge Management) @inproceedings{pub90, title = {Facilitate IT-Providing SMEs in Software Development: a Semantic Helper for Filtering and Searching Knowledge}, author = {R. Martoglia}, url = {http://www.isgroup.unimore.it/article/seke11.pdf}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 23rd International Conference on Software Engineering and Knowledge Engineering, July 2011 (SEKE 2011)}, pages = {130-136}, address = {Miami Beach, USA}, abstract = {Software development is still considered a bottleneck in the advance of the Information Society. The recently started FACIT-SME European FP-7 project targets to facilitate the use and sharing of Software Engineering methods and best practices among software developing SMEs. On top of an Open Reference Model (ORM) serving as an underlying knowledge backbone, specific filtering/search mechanisms will support the identification of adequate processes and practices for specific enterprise needs. In this paper, we focus on the proposal of knowledge-based text analysis and retrieval techniques which will form a key component of the advanced filtering mechanisms of the project. The proposed solution is designed to be more powerful and flexible than standard syntactic search techniques, but also to be easily applicable for any SME. The preliminary experimental evaluation shows promising results. The present work is partially supported by the \'Facilitate IT-providing SMEs by Operation-related Models and Methods (FACIT-SME)\' project.}, keywords = {Approximate search, FACIT Project, SE Knowledge Management}, pubstate = {published}, tppubtype = {inproceedings} } Software development is still considered a bottleneck in the advance of the Information Society. The recently started FACIT-SME European FP-7 project targets to facilitate the use and sharing of Software Engineering methods and best practices among software developing SMEs. On top of an Open Reference Model (ORM) serving as an underlying knowledge backbone, specific filtering/search mechanisms will support the identification of adequate processes and practices for specific enterprise needs. In this paper, we focus on the proposal of knowledge-based text analysis and retrieval techniques which will form a key component of the advanced filtering mechanisms of the project. The proposed solution is designed to be more powerful and flexible than standard syntactic search techniques, but also to be easily applicable for any SME. The preliminary experimental evaluation shows promising results. The present work is partially supported by the 'Facilitate IT-providing SMEs by Operation-related Models and Methods (FACIT-SME)' project. |
86. | F. Mandreoli, W. Penzo, S. Rizzi, M. Golfarelli, E. Turricchia (2011): BIN: Business Intelligence Networks. Business Intelligence Applications and the Web: Models, Systems and Technologies, pp. 244-265, 2011. (Type: Incollection | Abstract | BibTeX | Tags: BIN, Data Sharing) @incollection{pub92, title = {BIN: Business Intelligence Networks}, author = {F. Mandreoli and W. Penzo and S. Rizzi and M. Golfarelli and E. Turricchia}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, booktitle = {Business Intelligence Applications and the Web: Models, Systems and Technologies}, pages = {244-265}, abstract = {Cooperation is seen by companies as one of the major means for increasing flexibility and innovating. Business intelligence (BI) platforms are aimed at serving individual companies, and they cannot operate over networks of companies characterized by an organizational, lexical, and semantic heterogeneity. In this chapter we propose a framework, called Business Intelligence Network (BIN), for sharing BI functionalities over complex networks of companies that are chasing mutual advantages through the sharing of strategic information. A BIN is based on a network of peers, one for each company participating in the consortium. Peers are equipped with independent BI platforms that expose some querying functionalities aimed at sharing business information for the decision-making process. After proposing an architecture for a BIN, we outline the main research issues involved in its building and operating, and we focus on the definition of an ad hoc language for expressing semantic mappings between the multidimensional schemata owned by the different peers, aimed at enabling query reformulation over the network.}, keywords = {BIN, Data Sharing}, pubstate = {published}, tppubtype = {incollection} } Cooperation is seen by companies as one of the major means for increasing flexibility and innovating. Business intelligence (BI) platforms are aimed at serving individual companies, and they cannot operate over networks of companies characterized by an organizational, lexical, and semantic heterogeneity. In this chapter we propose a framework, called Business Intelligence Network (BIN), for sharing BI functionalities over complex networks of companies that are chasing mutual advantages through the sharing of strategic information. A BIN is based on a network of peers, one for each company participating in the consortium. Peers are equipped with independent BI platforms that expose some querying functionalities aimed at sharing business information for the decision-making process. After proposing an architecture for a BIN, we outline the main research issues involved in its building and operating, and we focus on the definition of an ad hoc language for expressing semantic mappings between the multidimensional schemata owned by the different peers, aimed at enabling query reformulation over the network. |
85. | F. Mandreoli, R. Haider, A. Prati, M. Fornaciari, R. Cucchiara (2011): Identification of intruders in groups of people using cameras and RFIDs. Fifth ACM/IEEE International Conference on Distributed Smart Camer, August 2011 (ICDSC 2011), pp. 1-6, Ghent, Belgium, 2011. (Type: Inproceeding | Abstract | BibTeX | Tags: Sensors and RFIDs) @inproceedings{pub94, title = {Identification of intruders in groups of people using cameras and RFIDs}, author = {F. Mandreoli and R. Haider and A. Prati and M. Fornaciari and R. Cucchiara}, year = {2011}, date = {2011-01-01}, urldate = {2013-06-12}, booktitle = {Fifth ACM/IEEE International Conference on Distributed Smart Camer, August 2011 (ICDSC 2011)}, pages = {1-6}, address = {Ghent, Belgium}, abstract = {The identification of intruders in groups of people moving in wide open areas represents a challenging scenario where coordination between cameras can be certainly used but this solution is not enough. In this paper, we propose to go beyond pure vision-based approaches by integrating the use of distributed cameras with the RFID technology. To this end, we introduce a system that maps RFID tags to people detected by cameras by using sophisticated techniques to filter the singular modalities and an evidential fusion architecture, based on Transferable Belief Model, to combine the two sources of information and manage conflict between them. The conducted experimental evaluation shows very promising results, especially in treating groups of people.}, keywords = {Sensors and RFIDs}, pubstate = {published}, tppubtype = {inproceedings} } The identification of intruders in groups of people moving in wide open areas represents a challenging scenario where coordination between cameras can be certainly used but this solution is not enough. In this paper, we propose to go beyond pure vision-based approaches by integrating the use of distributed cameras with the RFID technology. To this end, we introduce a system that maps RFID tags to people detected by cameras by using sophisticated techniques to filter the singular modalities and an evidential fusion architecture, based on Transferable Belief Model, to combine the two sources of information and manage conflict between them. The conducted experimental evaluation shows very promising results, especially in treating groups of people. |
2010 | |
84. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli (2010): Data Management Issues for Intelligent Transportation Systems. Proceedings of the 18th Italian Symposium on Advanced Database Systems, June 2010 (SEBD 2010), pp. 198-209, Rimini (RN), Italy, 2010. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Stream, ITS Data Management, Pegasus project) @inproceedings{pub80, title = {Data Management Issues for Intelligent Transportation Systems}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli}, url = {http://www.isgroup.unimore.it/article/sebd10.pdf}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 18th Italian Symposium on Advanced Database Systems, June 2010 (SEBD 2010)}, pages = {198-209}, address = {Rimini (RN), Italy}, abstract = {In this paper we discuss the technical challenges of devising a Data Stream Management System (DSMS) in the intelligent transportation scenario considered in the PEGASUS project, where the final aim is to provide reliable and timely information to improve the safety and the efficiency of vehicles\' and goods\' flows. The system should collect and integrate the large amounts of geo-located stream items coming from On Board Units (OBUs) installed on vehicles, with the aim of producing real-time maps including traffic and Points Of Interest (POIs) information to be then distributed to OBUs. OBUs\' smart navigation engines will exploit these maps to enhance mobility and provide user-targeted information. We propose a two-tiered GIS DSMS architecture where stream items are pulled from the source input stream, processed and stored in a result container to be further pulled by other operators. The system reduces the data acquisition costs by adopting communication-saving policies, supports ad-hoc strategies for reducing the storage management costs (lowering response times and memory consumption), and provides the required data access functionalities through an SQL-like query language enhanced with stream, event, spatial and temporal operators. OBU stream items are also exploited to detect Events Of Interest (EOIs) such as jams and accidents and to support a collaborative mechanism for user-powered POI management and rating. EOIs and POIs are modeled through specific ontologies which allow for a flexible and extensible data management and guarantee data independence from the raw streams.}, keywords = {Data Stream, ITS Data Management, Pegasus project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we discuss the technical challenges of devising a Data Stream Management System (DSMS) in the intelligent transportation scenario considered in the PEGASUS project, where the final aim is to provide reliable and timely information to improve the safety and the efficiency of vehicles' and goods' flows. The system should collect and integrate the large amounts of geo-located stream items coming from On Board Units (OBUs) installed on vehicles, with the aim of producing real-time maps including traffic and Points Of Interest (POIs) information to be then distributed to OBUs. OBUs' smart navigation engines will exploit these maps to enhance mobility and provide user-targeted information. We propose a two-tiered GIS DSMS architecture where stream items are pulled from the source input stream, processed and stored in a result container to be further pulled by other operators. The system reduces the data acquisition costs by adopting communication-saving policies, supports ad-hoc strategies for reducing the storage management costs (lowering response times and memory consumption), and provides the required data access functionalities through an SQL-like query language enhanced with stream, event, spatial and temporal operators. OBU stream items are also exploited to detect Events Of Interest (EOIs) such as jams and accidents and to support a collaborative mechanism for user-powered POI management and rating. EOIs and POIs are modeled through specific ontologies which allow for a flexible and extensible data management and guarantee data independence from the raw streams. |
83. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2010): Leveraging Semantic Approximations in Heterogeneous XML Data Sharing Networks: The SUNRISE Approach.. Soft Computing in XML Data Management, Zongmin Ma and Li Yan (Eds.), Springer, 2010. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @incollection{pub81, title = { Leveraging Semantic Approximations in Heterogeneous XML Data Sharing Networks: The SUNRISE Approach.}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, booktitle = {Soft Computing in XML Data Management, Zongmin Ma and Li Yan (Eds.), Springer}, abstract = {In recent years, the huge amount of data available from Internet information sources has focused much attention on the sharing of distributed information through P2P and, in line with the Semantic Web vision, through Peer Data Management Systems (PDMSs). On the other hand, XML is with no doubt the most popular data representation and exchange format on the Web and more and more Internet applications are conforming to this de facto standard for data sharing. In this chapter we present SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration) for XML data sharing. SUNRISE is a complete PDMS infrastructure aiming at semantic interoperability in heterogeneous networks. Decentralized data sharing is supported by a set of autonomous peers which model their local data through schemas and which are locally connected through semantic mappings. SUNRISE leverages the semantic approximations originating from schemas\' heterogeneity for an effective and efficient organization and exploration of the network. For these purposes, SUNRISE implements soft computing techniques which cluster peers in Semantic Overlay Networks according to their own contents, and promote the routing of queries towards the semantically best directions in the network.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {incollection} } In recent years, the huge amount of data available from Internet information sources has focused much attention on the sharing of distributed information through P2P and, in line with the Semantic Web vision, through Peer Data Management Systems (PDMSs). On the other hand, XML is with no doubt the most popular data representation and exchange format on the Web and more and more Internet applications are conforming to this de facto standard for data sharing. In this chapter we present SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration) for XML data sharing. SUNRISE is a complete PDMS infrastructure aiming at semantic interoperability in heterogeneous networks. Decentralized data sharing is supported by a set of autonomous peers which model their local data through schemas and which are locally connected through semantic mappings. SUNRISE leverages the semantic approximations originating from schemas' heterogeneity for an effective and efficient organization and exploration of the network. For these purposes, SUNRISE implements soft computing techniques which cluster peers in Semantic Overlay Networks according to their own contents, and promote the routing of queries towards the semantically best directions in the network. |
82. | R. Martoglia (2010): Information Retrieval Techniques for Pattern Matching - Managing and Searching Textual and XML Information in 21st Century Applications. LAP Publishing, 2010, ISBN: 978-3838372532. (Type: Book | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, Data Versioning, EBMT, Structural Disambiguation, Twig Query Processing) @book{pub83, title = {Information Retrieval Techniques for Pattern Matching - Managing and Searching Textual and XML Information in 21st Century Applications}, author = {R. Martoglia}, url = {http://www.isgroup.unimore.it/article/buchpreview.pdf}, isbn = {978-3838372532}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, publisher = {LAP Publishing}, abstract = {Information is the main value of Information Society. The recent developments in computing power and telecommunications, along with the constant drop of Internet access costs and data management and storing, created the right conditions for the global diffusion of the Web and, more generally, of new research tools able to analyze information and their contents. Depending on the particular application scenario and on the type of information that has to be managed and searched, different techniques need to be devised. In this book, the author deals with the two most common types of information: plain text, discussed in the first part, and semi-structured data, in particular XML documents, deeply discussed the second part. The detailed analysis of approximate matching, duplicate document detection, exact, approximate and semantic query answering, multi-version document management and personalized access techniques offered in this book will guide Information Technology professionals and users in effectively and efficiently managing information and knowledge, thus answering the increasingly complex Information needs of most 21st century applications.}, keywords = {Approximate search, Data Sharing, Data Versioning, EBMT, Structural Disambiguation, Twig Query Processing}, pubstate = {published}, tppubtype = {book} } Information is the main value of Information Society. The recent developments in computing power and telecommunications, along with the constant drop of Internet access costs and data management and storing, created the right conditions for the global diffusion of the Web and, more generally, of new research tools able to analyze information and their contents. Depending on the particular application scenario and on the type of information that has to be managed and searched, different techniques need to be devised. In this book, the author deals with the two most common types of information: plain text, discussed in the first part, and semi-structured data, in particular XML documents, deeply discussed the second part. The detailed analysis of approximate matching, duplicate document detection, exact, approximate and semantic query answering, multi-version document management and personalized access techniques offered in this book will guide Information Technology professionals and users in effectively and efficiently managing information and knowledge, thus answering the increasingly complex Information needs of most 21st century applications. |
81. | R. Martoglia, S. Bergamaschi, S. Lodi, C. Sartori (2010): Proceedings of the Eighteenth Italian Symposium on Advanced Database Systems. Esculapio Editore, 2010, ISBN: 978-88-7488-369-1. (Type: Book | Abstract | Links | BibTeX | Tags: ) @book{pub85, title = {Proceedings of the Eighteenth Italian Symposium on Advanced Database Systems}, author = {R. Martoglia and S. Bergamaschi and S. Lodi and C. Sartori}, url = {http://www.isgroup.unimore.it/article/sebd10-frontmatter.pdf}, isbn = {978-88-7488-369-1}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, publisher = {Esculapio Editore}, abstract = {This volume collects the papers selected for presentation at the Eighteenth Italian Symposium on Advanced Database Systems (SEBD 2010), held in Rimini, Italy, from the 20th to the 23rd of June 2010. SEBD is the major annual event of the Italian database research community. The symposium is conceived as a gathering forum for the discussion and exchange of ideas and experiences among researchers and experts from the academy and industry, about all aspects of database systems and their applications.}, keywords = {}, pubstate = {published}, tppubtype = {book} } This volume collects the papers selected for presentation at the Eighteenth Italian Symposium on Advanced Database Systems (SEBD 2010), held in Rimini, Italy, from the 20th to the 23rd of June 2010. SEBD is the major annual event of the Italian database research community. The symposium is conceived as a gathering forum for the discussion and exchange of ideas and experiences among researchers and experts from the academy and industry, about all aspects of database systems and their applications. |
80. | F. Mandreoli, R. Martoglia, S. Sassatelli, P. Tiberio, W. Penzo, C. Gennaro, M. Mordacchini, S. Orlando (2010): Toward an Effective and Efficient Query Processing in the NeP4B Project. Information Systems: People, Organizations, Institutions, and Technologies, 2010. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @incollection{pub87, title = {Toward an Effective and Efficient Query Processing in the NeP4B Project}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and P. Tiberio and W. Penzo and C. Gennaro and M. Mordacchini and S. Orlando}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, booktitle = {Information Systems: People, Organizations, Institutions, and Technologies}, abstract = {In this paper we present our main current research activity in the Italian co-funded FIRB Project NeP4B (Networked Peers for Business). In particular, we provide an overview of our P2P query routing approach which combines semantics and multimedia aspects in order to make query processing effective and efficient.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {incollection} } In this paper we present our main current research activity in the Italian co-funded FIRB Project NeP4B (Networked Peers for Business). In particular, we provide an overview of our P2P query routing approach which combines semantics and multimedia aspects in order to make query processing effective and efficient. |
79. | R. Haider, F. Mandreoli, R. Martoglia, S. Sassatelli, P. Tiberio (2010): Toward a Flexible Data Management Middleware for Wireless Sensor Networks. Management of the Inteconnected World, 2010. (Type: Incollection | Abstract | BibTeX | Tags: Data Stream, Outdoor Video Protection Project, Radio Sensors Project) @incollection{pub88, title = {Toward a Flexible Data Management Middleware for Wireless Sensor Networks}, author = {R. Haider and F. Mandreoli and R. Martoglia and S. Sassatelli and P. Tiberio}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, booktitle = {Management of the Inteconnected World}, abstract = {In this paper we present the research activity we are carrying out in the \"\"Mobile Semantic Self-Organizing Wireless Sensor Networks\"\" Project at the Department of Information Engineering of the University of Modena and Reggio Emilia. In this context, the main aim of our research is to study solutions for the flexible querying of distributed data collected by heterogeneous devices providing measurement readings. To this end, we propose a middleware for wireless sensor networks which is able to autonomously configure the communication and the operations required to each device in order to reduce energy and temporal costs.}, keywords = {Data Stream, Outdoor Video Protection Project, Radio Sensors Project}, pubstate = {published}, tppubtype = {incollection} } In this paper we present the research activity we are carrying out in the ""Mobile Semantic Self-Organizing Wireless Sensor Networks"" Project at the Department of Information Engineering of the University of Modena and Reggio Emilia. In this context, the main aim of our research is to study solutions for the flexible querying of distributed data collected by heterogeneous devices providing measurement readings. To this end, we propose a middleware for wireless sensor networks which is able to autonomously configure the communication and the operations required to each device in order to reduce energy and temporal costs. |
78. | F. Mandreoli, W. Penzo, S. Rizzi, M. Golfarelli, E. Turricchia (2010): Towards OLAP Query Reformulation in Peer-to-Peer Data Warehousing. Proceedings 13th International Workshop on Data Warehousing and OL, November 2010 (DOLAP 2010), pp. 37-44, Toronto, Canada, 2010. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: BIN, Data Sharing) @inproceedings{pub91, title = {Towards OLAP Query Reformulation in Peer-to-Peer Data Warehousing}, author = {F. Mandreoli and W. Penzo and S. Rizzi and M. Golfarelli and E. Turricchia}, url = {http://www.isgroup.unimore.it/article/dolap11.pdf}, year = {2010}, date = {2010-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings 13th International Workshop on Data Warehousing and OL, November 2010 (DOLAP 2010)}, pages = {37-44}, address = {Toronto, Canada}, abstract = {Inter-business collaborative contexts prefigure a distributed scenario where companies organize and coordinate themselves to develop common and shared opportunities. Traditional business intelligence systems do not provide support to this end. Peer Data Management Systems (PDMSs) have been proposed as architectures to support sharing of operational data across networks of peers while guaranteeing peers\' autonomy, based on semantic mappings that mediate between the heterogeneous schemata exposed by peers. In line with the PDMS infrastructure, in this paper we envision a peer-to-peer data warehousing architecture based on a network of heterogeneous peers, each exposing query answering functionalities aimed at sharing business information. To enhance the decision making process, an OLAP query expressed on a peer needs be properly reformulated on the other peers. In this direction, we present a language for the definition of mappings between the multidimensional schemata of peers, and we introduce a query reformulation framework that relies on the translation of these mappings towards relational schemata. Finally, we sketch the query reformulation algorithm by outlining the reformulation steps of typical OLAP queries.}, keywords = {BIN, Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } Inter-business collaborative contexts prefigure a distributed scenario where companies organize and coordinate themselves to develop common and shared opportunities. Traditional business intelligence systems do not provide support to this end. Peer Data Management Systems (PDMSs) have been proposed as architectures to support sharing of operational data across networks of peers while guaranteeing peers' autonomy, based on semantic mappings that mediate between the heterogeneous schemata exposed by peers. In line with the PDMS infrastructure, in this paper we envision a peer-to-peer data warehousing architecture based on a network of heterogeneous peers, each exposing query answering functionalities aimed at sharing business information. To enhance the decision making process, an OLAP query expressed on a peer needs be properly reformulated on the other peers. In this direction, we present a language for the definition of mappings between the multidimensional schemata of peers, and we introduce a query reformulation framework that relies on the translation of these mappings towards relational schemata. Finally, we sketch the query reformulation algorithm by outlining the reformulation steps of typical OLAP queries. |
2009 | |
77. | F. Mandreoli, R. Martoglia, E. Ronchetti (2009): Native Temporal Slicing Support for XML Databases. The Open Information Science Journal (TOISJ,), 2 (1), pp. 2-9, 2009. (Type: Journal Article | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @article{pub68, title = {Native Temporal Slicing Support for XML Databases}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/toisj09.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, journal = {The Open Information Science Journal (TOISJ,)}, volume = {2}, number = {1}, pages = {2-9}, abstract = {XML databases, providing structural querying support, are becoming more and more popular. As we know, XML data may change over time and providing an efficient support to queries which also involve temporal aspects is still an open issue. In this paper we present our native Temporal XML Query Processor, which exploits an ad-hoc temporal indexing scheme relying on relational approaches and a technology supporting temporal slicing. As we show through an extensive experimental evaluation, our solution achieves good efficiency results, outperforming stratum-based solutions when dealing with time-related application requirements while continuing to guarantee good performance in traditional scenarios.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {article} } XML databases, providing structural querying support, are becoming more and more popular. As we know, XML data may change over time and providing an efficient support to queries which also involve temporal aspects is still an open issue. In this paper we present our native Temporal XML Query Processor, which exploits an ad-hoc temporal indexing scheme relying on relational approaches and a technology supporting temporal slicing. As we show through an extensive experimental evaluation, our solution achieves good efficiency results, outperforming stratum-based solutions when dealing with time-related application requirements while continuing to guarantee good performance in traditional scenarios. |
76. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli (2009): Data-Sharing P2P Networks with Semantic Approximation Capabilities. IEEE Internet Computing (IEEE), 13 (5), pp. 60-70, 2009. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @article{pub69, title = {Data-Sharing P2P Networks with Semantic Approximation Capabilities}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, journal = {IEEE Internet Computing (IEEE)}, volume = {13}, number = {5}, pages = {60-70}, abstract = {The synergy between peer-to-peer systems and Semantic Web technologies supports large-scale sharing of semantically rich data, usually represented through schemas such as RDF. Because peers rarely share the same vocabulary, the resulting heterogeneity of data representations introduces new challenges for the efficient and effective retrieval of relevant information. The authors leverage the presence of semantic approximations between peers\' schemas to improve query routing by identifying the peers that best satisfy the user\'s requests, and to inform users of the relevance of the returned answers through a ranking mechanism that promotes the most semantically related results.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {article} } The synergy between peer-to-peer systems and Semantic Web technologies supports large-scale sharing of semantically rich data, usually represented through schemas such as RDF. Because peers rarely share the same vocabulary, the resulting heterogeneity of data representations introduces new challenges for the efficient and effective retrieval of relevant information. The authors leverage the presence of semantic approximations between peers' schemas to improve query routing by identifying the peers that best satisfy the user's requests, and to inform users of the relevance of the returned answers through a ranking mechanism that promotes the most semantically related results. |
75. | F. Mandreoli, R. Martoglia, W. Penzo, G. Villani (2009): Flexible Query Answering on Graph-modeled Data. Proceedings of the 12th International Conference on Extending Database Technology, March 2009 (EDBT 2009), pp. 216-227, Saint-Petersburg, Russia, 2009. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, NeP4B Project) @inproceedings{pub72, title = {Flexible Query Answering on Graph-modeled Data}, author = {F. Mandreoli and R. Martoglia and W. Penzo and G. Villani}, url = {http://www.isgroup.unimore.it/article/edbt09.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 12th International Conference on Extending Database Technology, March 2009 (EDBT 2009)}, pages = {216-227}, address = {Saint-Petersburg, Russia}, abstract = {The largeness and the heterogeneity of most graph-modeled datasets in several database application areas make the query process a real challenge because of the lack of a complete knowledge of the vocabulary used, as well as of the information about the structural relationships between the data. To overcome these problems, flexible query answering capabilitiesare an essential need. In this paper we present a general model for supporting approximate queries on graph-modeled data. Approximation is both on the vocabularies and the structure. The model is general in that it is not bound to a specific graph data model, rather it gracefully accommodates labeled directed/undirected data graphs with labeled/unlabeled edges. The query answering principles underlying the model are not compelled to a specific data graph, instead they are founded on properties inferable from the data model the data graph conforms to. We complement the work with a ranking model to deal with data approximations and with an efficient top-k retrieval algorithm which smartly accesses ad-hoc data structures and generates the most promising answers in an order correlated with the ranking measures. Experimental results prove the good effectiveness and efficiency of our proposal on different real world datasets.}, keywords = {Approximate search, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } The largeness and the heterogeneity of most graph-modeled datasets in several database application areas make the query process a real challenge because of the lack of a complete knowledge of the vocabulary used, as well as of the information about the structural relationships between the data. To overcome these problems, flexible query answering capabilitiesare an essential need. In this paper we present a general model for supporting approximate queries on graph-modeled data. Approximation is both on the vocabularies and the structure. The model is general in that it is not bound to a specific graph data model, rather it gracefully accommodates labeled directed/undirected data graphs with labeled/unlabeled edges. The query answering principles underlying the model are not compelled to a specific data graph, instead they are founded on properties inferable from the data model the data graph conforms to. We complement the work with a ranking model to deal with data approximations and with an efficient top-k retrieval algorithm which smartly accesses ad-hoc data structures and generates the most promising answers in an order correlated with the ranking measures. Experimental results prove the good effectiveness and efficiency of our proposal on different real world datasets. |
74. | F. Mandreoli, R. Martoglia, P. Zezula (2009): Principles of Holism for Sequential Twig Pattern Matching. VLDB Journal (VLDBJ), 18 (6), pp. 1369-1392, 2009. (Type: Journal Article | Abstract | BibTeX | Tags: Twig Query Processing) @article{pub74, title = {Principles of Holism for Sequential Twig Pattern Matching}, author = {F. Mandreoli and R. Martoglia and P. Zezula}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, journal = {VLDB Journal (VLDBJ)}, volume = {18}, number = {6}, pages = {1369-1392}, abstract = {Modern applications face the challenge of dealing with structured and semi-structured data. They have to deal with complex objects, most of them presenting some kind of internal structure, which often forms a hierarchy. Though XML documents are the most known, chemical compounds, CAD drawings, web-sites and many other applications have to deal with similar problems. In such environments, ordered and unordered tree pattern matching are the fundamental search operations. One of the main thrusts of research activities for tree pattern matching is the class of holistic approaches. Their ultimate goal is to evaluate a query twig as a whole by relying on sequential access patterns and non trivial auxiliary storage structures, typically stored in main memory. Based on the pre/post-order ranks of individual tree nodes, we establish strong theoretical bases as a foundation for correct and efficient holistic pattern matching algorithms. In particular, we define and prove sufficient and necessary conditions to minimize the amount of data retained in memory thus introducing a correct and complete framework on which different holistic solutions can be compared. We also show how these rules can be applied for building algorithms for ordered and unordered tree pattern matching. Thanks to the above theoretical achievements, each holistic algorithm gains in efficiency as it is directly implemented on the adopted numbering scheme, avoid expensive matching refinements and keep memory requirements stable. An experimental analysis and comparison with previous approaches confirms the superiority of our approach tested on synthetic as well as real-life data sets.}, keywords = {Twig Query Processing}, pubstate = {published}, tppubtype = {article} } Modern applications face the challenge of dealing with structured and semi-structured data. They have to deal with complex objects, most of them presenting some kind of internal structure, which often forms a hierarchy. Though XML documents are the most known, chemical compounds, CAD drawings, web-sites and many other applications have to deal with similar problems. In such environments, ordered and unordered tree pattern matching are the fundamental search operations. One of the main thrusts of research activities for tree pattern matching is the class of holistic approaches. Their ultimate goal is to evaluate a query twig as a whole by relying on sequential access patterns and non trivial auxiliary storage structures, typically stored in main memory. Based on the pre/post-order ranks of individual tree nodes, we establish strong theoretical bases as a foundation for correct and efficient holistic pattern matching algorithms. In particular, we define and prove sufficient and necessary conditions to minimize the amount of data retained in memory thus introducing a correct and complete framework on which different holistic solutions can be compared. We also show how these rules can be applied for building algorithms for ordered and unordered tree pattern matching. Thanks to the above theoretical achievements, each holistic algorithm gains in efficiency as it is directly implemented on the adopted numbering scheme, avoid expensive matching refinements and keep memory requirements stable. An experimental analysis and comparison with previous approaches confirms the superiority of our approach tested on synthetic as well as real-life data sets. |
73. | F. Mandreoli, R. Martoglia, W. Penzo, G. Villani (2009): Semantics-driven Approximate Query Answering on Graph Databases. Proceedings of the 17th Italian Symposium on Advanced Database Systems, June 2009 (SEBD 2009), pp. 21-28, Camogli (GE), Italy, 2009. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, NeP4B Project) @inproceedings{pub75, title = {Semantics-driven Approximate Query Answering on Graph Databases}, author = {F. Mandreoli and R. Martoglia and W. Penzo and G. Villani}, url = {http://www.isgroup.unimore.it/article/sebd09.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 17th Italian Symposium on Advanced Database Systems, June 2009 (SEBD 2009)}, pages = {21-28}, address = {Camogli (GE), Italy}, abstract = {Several database application areas need to deal with graph-modeled datasets. The main features of these datasets are the largeness and the heterogeneity of the data, which make it impractical to answer exact queries. In this paper we present our recent research efforts in modeling flexible query answering capabilities in this context. Flexibility is captured by approximations both on the labels and on the structure of graph-based queries, by guaranteeing semantically meaningful relaxations only. In order to cope with the excess of results, we adapt a well-known top-k retrieval algorithm to our context. The good effectiveness and efficiency of our proposal are proved by an extensive experimental evaluation on different real world datasets.}, keywords = {Approximate search, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Several database application areas need to deal with graph-modeled datasets. The main features of these datasets are the largeness and the heterogeneity of the data, which make it impractical to answer exact queries. In this paper we present our recent research efforts in modeling flexible query answering capabilities in this context. Flexibility is captured by approximations both on the labels and on the structure of graph-based queries, by guaranteeing semantically meaningful relaxations only. In order to cope with the excess of results, we adapt a well-known top-k retrieval algorithm to our context. The good effectiveness and efficiency of our proposal are proved by an extensive experimental evaluation on different real world datasets. |
72. | C. Gennaro, F. Mandreoli, R. Martoglia, M. Mordacchini, W. Penzo, S. Sassatelli (2009): Combining Semantic and Multimedia Query Routing Techniques for Unified Data Retrieval in a PDMS. Proceedings of the 1st International Workshop on Interoperability through Semantic Data and Service Integration, June 2009 (ISDSI 2009), pp. 17-28, Camogli (GE), Italy, 2009. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub76, title = {Combining Semantic and Multimedia Query Routing Techniques for Unified Data Retrieval in a PDMS}, author = {C. Gennaro and F. Mandreoli and R. Martoglia and M. Mordacchini and W. Penzo and S. Sassatelli}, url = {http://www.isgroup.unimore.it/article/isdsi09.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st International Workshop on Interoperability through Semantic Data and Service Integration, June 2009 (ISDSI 2009)}, pages = {17-28}, address = {Camogli (GE), Italy}, abstract = {The NeP4B project aims at the development of an advanced technological infrastructure for data sharing in a network of business partners. In this paper we leverage our distinct experiences on semantic and multimedia query routing, and propose an innovative mechanism for an effective and efficient unified data retrieval of both semantic and multimedia data in the context of the NeP4B project.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } The NeP4B project aims at the development of an advanced technological infrastructure for data sharing in a network of business partners. In this paper we leverage our distinct experiences on semantic and multimedia query routing, and propose an innovative mechanism for an effective and efficient unified data retrieval of both semantic and multimedia data in the context of the NeP4B project. |
71. | R. Martoglia (2009): Shaping Tomorrow Information Management, Today. 2015 Scientific Economic Magazine (2015), 1 (1), pp. 86-91, 2009. (Type: Journal Article | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, Data Stream, Data Versioning, EBMT, Structural Disambiguation, Twig Query Processing) @article{pub77, title = {Shaping Tomorrow Information Management, Today}, author = {R. Martoglia}, url = {http://www.isgroup.unimore.it/article/2015.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, journal = {2015 Scientific Economic Magazine (2015)}, volume = {1}, number = {1}, pages = {86-91}, abstract = {The recent developments in computing power and telecommunications, and, in general, the advanced ICT (Information and Communication Technology) of the 20th century, accelerated the use and value of Information in our society. Indeed, Information is the main value of Information Society. In this respect, the World Wide Web, Peer-to-Peer networks, mobile devices and ubiquitous computing systems and sensors give us more and more interesting possibilities today; however, current research on the relevant technologies, structures and services is still not enough mature. Research at the Information Systems Group (ISGroup), inside the Information Engineering Department (DII) of the Modena and Reggio Emilia University, is focused on the design and development of new systems, algorithms and data structures for the access and management of Information. The group constantly devises and puts into practice, also by means of national and international research projects and collaborations, innovative solutions able to answer, both effectively and efficiently, increasingly complex Information needs in several 21st century applications. Information is everywhere and comes in many flavours: textual information, multi-lingual information, structural (XML) and multimedia information, multi-version information. Think, for instance, to product descriptions, data sheets, notes, web information, real-time data coming from sensors, etc. What follows is a short presentation of the past and present research activities of the group; thanks to this overview, the reader will have a glimpse of many of the practical applications that benefited from the obtained results, also by possibly investigating them further through the provided references. Most importantly, the key message is that information management, in its many forms, is crucial at every level and in every application scenario; indeed, the following is only an example of what can be achieved. ISGroup will be pleased to be contacted and to take up any type of proposed information management challenge, even through new collaborations or projects.}, keywords = {Approximate search, Data Sharing, Data Stream, Data Versioning, EBMT, Structural Disambiguation, Twig Query Processing}, pubstate = {published}, tppubtype = {article} } The recent developments in computing power and telecommunications, and, in general, the advanced ICT (Information and Communication Technology) of the 20th century, accelerated the use and value of Information in our society. Indeed, Information is the main value of Information Society. In this respect, the World Wide Web, Peer-to-Peer networks, mobile devices and ubiquitous computing systems and sensors give us more and more interesting possibilities today; however, current research on the relevant technologies, structures and services is still not enough mature. Research at the Information Systems Group (ISGroup), inside the Information Engineering Department (DII) of the Modena and Reggio Emilia University, is focused on the design and development of new systems, algorithms and data structures for the access and management of Information. The group constantly devises and puts into practice, also by means of national and international research projects and collaborations, innovative solutions able to answer, both effectively and efficiently, increasingly complex Information needs in several 21st century applications. Information is everywhere and comes in many flavours: textual information, multi-lingual information, structural (XML) and multimedia information, multi-version information. Think, for instance, to product descriptions, data sheets, notes, web information, real-time data coming from sensors, etc. What follows is a short presentation of the past and present research activities of the group; thanks to this overview, the reader will have a glimpse of many of the practical applications that benefited from the obtained results, also by possibly investigating them further through the provided references. Most importantly, the key message is that information management, in its many forms, is crucial at every level and in every application scenario; indeed, the following is only an example of what can be achieved. ISGroup will be pleased to be contacted and to take up any type of proposed information management challenge, even through new collaborations or projects. |
70. | R. Haider, F. Mandreoli, R. Martoglia, S. Sassatelli, P. Tiberio (2009): Toward a Flexible Data Management Middleware for Wireless Sensor Networks. Proceedings of the VI Conference of the Italian Chapter of AIS, October 2009 (itAIS 2009), pp. 165-173, Costa Smeralda, Italy, 2009. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Stream, Radio Sensors Project) @inproceedings{pub78, title = {Toward a Flexible Data Management Middleware for Wireless Sensor Networks}, author = {R. Haider and F. Mandreoli and R. Martoglia and S. Sassatelli and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/itais09.pdf}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the VI Conference of the Italian Chapter of AIS, October 2009 (itAIS 2009)}, pages = {165-173}, address = {Costa Smeralda, Italy}, abstract = {In this paper we present the research activity we are carrying out in the \"\"Mobile Semantic Self-Organizing Wireless Sensor Networks\"\" Project at the Department of Information Engineering of the University of Modena and Reggio Emilia. In this context, the main aim of our research is to study solutions for the flexible querying of distributed data collected by heterogeneous devices providing measurement readings. To this end, we propose a middleware for wireless sensor networks which is able to autonomously configure the communication and the operations required to each device in order to reduce energy and temporal costs. }, keywords = {Data Stream, Radio Sensors Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we present the research activity we are carrying out in the ""Mobile Semantic Self-Organizing Wireless Sensor Networks"" Project at the Department of Information Engineering of the University of Modena and Reggio Emilia. In this context, the main aim of our research is to study solutions for the flexible querying of distributed data collected by heterogeneous devices providing measurement readings. To this end, we propose a middleware for wireless sensor networks which is able to autonomously configure the communication and the operations required to each device in order to reduce energy and temporal costs. |
69. | S. Bergamaschi, F. Guerra, F. Mandreoli, M. Vincini (2009): Working in a Dynamic Environment: the NeP4B Approach as a MAS. Proceedings of the 8th International Workshop on Agent and Peer to Peer Computing, May 2009 (AP2PC 2009), pp. 1-13, Budapest, Hungary, 2009. (Type: Inproceeding | Abstract | BibTeX | Tags: Data Sharing, NeP4B Project) @inproceedings{pub79, title = {Working in a Dynamic Environment: the NeP4B Approach as a MAS}, author = {S. Bergamaschi and F. Guerra and F. Mandreoli and M. Vincini}, year = {2009}, date = {2009-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 8th International Workshop on Agent and Peer to Peer Computing, May 2009 (AP2PC 2009)}, pages = {1-13}, address = {Budapest, Hungary}, abstract = {Integration of heterogeneous information in the context of Internet is becoming a key activity to enable a more organized and semantically meaningful access to several kinds of information in the form of data sources, multimedia documents and web services. In NeP4B (Networked Peers for Business), a project funded by the Italian Ministry of University and Research, we developed an approach for providing a uniform representation of data, multimedia and services, thus allowing users to obtain sets of data, multimedia documents and lists of webservices as query results. NeP4B is based on a P2P network of semantic peers, connected one with each other by means of automatically generated mappings. In this paper we present a new architecture for NeP4B, based on a Multi-Agent System.We claim that such a solution may be more efficient and effective, thanks to the agents\' autonomy and intelligence, in a dynamic environment, where sources are frequently added (or deleted) to (from) the network.}, keywords = {Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Integration of heterogeneous information in the context of Internet is becoming a key activity to enable a more organized and semantically meaningful access to several kinds of information in the form of data sources, multimedia documents and web services. In NeP4B (Networked Peers for Business), a project funded by the Italian Ministry of University and Research, we developed an approach for providing a uniform representation of data, multimedia and services, thus allowing users to obtain sets of data, multimedia documents and lists of webservices as query results. NeP4B is based on a P2P network of semantic peers, connected one with each other by means of automatically generated mappings. In this paper we present a new architecture for NeP4B, based on a Multi-Agent System.We claim that such a solution may be more efficient and effective, thanks to the agents' autonomy and intelligence, in a dynamic environment, where sources are frequently added (or deleted) to (from) the network. |
2008 | |
68. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2008): Paving the Way to an Effective and Efficient Retrieval of Data over Semantic Overlay Networks. The Semantic Web for Knowledge and Data Management: Technologies and Practices, Zhongmin Ma (Ed.), IGI Global, pp. 151-175, 2008. (Type: Incollection | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @incollection{pub61, title = {Paving the Way to an Effective and Efficient Retrieval of Data over Semantic Overlay Networks}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, url = {http://www.isgroup.unimore.it/article/mmpp08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {The Semantic Web for Knowledge and Data Management: Technologies and Practices, Zhongmin Ma (Ed.), IGI Global}, pages = {151-175}, abstract = {In a Peer-to-Peer (P2P) system, a Semantic Overlay Network (SON) models a network of peers whose connections are influenced by the peers\' content, so that semantically related peers connect each other. This is very common in P2P communities, where peers share common interests, and a peer can belong to more than one SON, depending on its own interests. Querying such a kind of systems is not an easy task: The retrieval of relevant data can not rely on flooding approaches which forward a query to the overall network. A way of selecting which peers are more likely to provide relevant answers is necessary to support more efficient and effective query processing strategies. This chapter presents a semantic infrastructure for routing queries effectively in a network of SONs. Peers are semantically rich, in that peers\' content is modelled with a schema on their local data, and peers are related each other through semantic mappings defined between their own schemas. A query is routed through the network by means of a sequence of reformulations, according to the semantic mappings encountered in the routing path. As reformulations may lead to semantic approximations, we define a fully distributed indexing mechanism which summarizes the semantics underlying whole subnetworks, in order to be able to locate the semantically best directions to forward a query to. In support of our proposal, we demonstrate through a rich set of experiments that our routing mechanism overtakes algorithms which are usually limited to the only knowledge of the peers directly connected to the querying peer, and that our approach is particularly successful in a SONs scenario.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {incollection} } In a Peer-to-Peer (P2P) system, a Semantic Overlay Network (SON) models a network of peers whose connections are influenced by the peers' content, so that semantically related peers connect each other. This is very common in P2P communities, where peers share common interests, and a peer can belong to more than one SON, depending on its own interests. Querying such a kind of systems is not an easy task: The retrieval of relevant data can not rely on flooding approaches which forward a query to the overall network. A way of selecting which peers are more likely to provide relevant answers is necessary to support more efficient and effective query processing strategies. This chapter presents a semantic infrastructure for routing queries effectively in a network of SONs. Peers are semantically rich, in that peers' content is modelled with a schema on their local data, and peers are related each other through semantic mappings defined between their own schemas. A query is routed through the network by means of a sequence of reformulations, according to the semantic mappings encountered in the routing path. As reformulations may lead to semantic approximations, we define a fully distributed indexing mechanism which summarizes the semantics underlying whole subnetworks, in order to be able to locate the semantically best directions to forward a query to. In support of our proposal, we demonstrate through a rich set of experiments that our routing mechanism overtakes algorithms which are usually limited to the only knowledge of the peers directly connected to the querying peer, and that our approach is particularly successful in a SONs scenario. |
67. | F. Mandreoli, W. Penzo, S. Sassatelli, S. Lodi, R. Martoglia (2008): Semantic Peer, Here are the Neighbors You Want!. Proceedings of the 11th International Conference on Extending Database Technology, March 2008 (EDBT 2008), pp. 26-37, Nantes, France, 2008. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Sharing, NeP4B Project) @inproceedings{pub63, title = {Semantic Peer, Here are the Neighbors You Want!}, author = {F. Mandreoli and W. Penzo and S. Sassatelli and S. Lodi and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/edbt08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 11th International Conference on Extending Database Technology, March 2008 (EDBT 2008)}, pages = {26-37}, address = {Nantes, France}, abstract = {Peer Data Management Systems (PDMSs) have been introduced as a solution to the problem of large-scale sharing of semantically rich data. A PDMS consists of semantic peers connected through semantic mappings. Querying a PDMS may lead to very poor results, because of the semantic degradation due to the approximations given by the traversal of the semantic mappings, thus leading to the problem of how to boost a network of mappings in a PDMS. In this paper we propose a strategy for the incremental maintenance of a flexible network organization that clusters together peers which are semantically related in Semantic Overlay Networks (SONs), while maintaining a high degree of node autonomy. Semantic features, a summarized representation of clusters, are stored in a \"\"light\"\" structure which effectively assists a newly entering peer when choosing its semantically closest overlay networks. Then, each peer is supported in the selection of its own neighbors within each overlay network according to two policies: Range-based selection and k-NN selection. For both policies, we introduce specific algorithms which exploit a distributed indexing mechanism for efficient network navigation. The proposed approach has been implemented in a prototype where its effectiveness and efficiency have been extensively tested.}, keywords = {Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Peer Data Management Systems (PDMSs) have been introduced as a solution to the problem of large-scale sharing of semantically rich data. A PDMS consists of semantic peers connected through semantic mappings. Querying a PDMS may lead to very poor results, because of the semantic degradation due to the approximations given by the traversal of the semantic mappings, thus leading to the problem of how to boost a network of mappings in a PDMS. In this paper we propose a strategy for the incremental maintenance of a flexible network organization that clusters together peers which are semantically related in Semantic Overlay Networks (SONs), while maintaining a high degree of node autonomy. Semantic features, a summarized representation of clusters, are stored in a ""light"" structure which effectively assists a newly entering peer when choosing its semantically closest overlay networks. Then, each peer is supported in the selection of its own neighbors within each overlay network according to two policies: Range-based selection and k-NN selection. For both policies, we introduce specific algorithms which exploit a distributed indexing mechanism for efficient network navigation. The proposed approach has been implemented in a prototype where its effectiveness and efficiency have been extensively tested. |
66. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2008): Building a PDMS Infrastructure for XML Data Sharing with SUNRISE. Proceedings of the 3rd International EDBT Workshop on Database Technologies for Handling XML Information on the Web, March 2008 (DATAX 2008), Nantes, France, 2008. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub64, title = {Building a PDMS Infrastructure for XML Data Sharing with SUNRISE}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, url = {http://www.isgroup.unimore.it/article/datax08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 3rd International EDBT Workshop on Database Technologies for Handling XML Information on the Web, March 2008 (DATAX 2008)}, address = {Nantes, France}, abstract = {Semantic support for data representation as well as a flexible machine-readable format have made XML the de facto standard for Internet applications semantic interoperability. Its applicability is primarily evident in realities where actors are heterogeneous data sources which interact each other for data sharing purposes. This is exactly the scenario envisioned by Peer Data Management Systems (PDMSs), where autonomous sources (peers) model their local data according to a schema, and are connected in a peer-to-peer network by means of pairwise semantic mappings between the peers\' own schemas. One of the main challenges in such a semantically heterogeneous environment is concerned with query processing when dealing with the inherent semantic approximations occurring in the data. In this paper we present an instantiation of SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration) for XML data sources. SUNRISE is a complete PDMS infrastructure which extends each peer with functionalities for capturing the semantic approximation originating from schema heterogeneity and exploiting it for a semantically driven network organization and query routing.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Semantic support for data representation as well as a flexible machine-readable format have made XML the de facto standard for Internet applications semantic interoperability. Its applicability is primarily evident in realities where actors are heterogeneous data sources which interact each other for data sharing purposes. This is exactly the scenario envisioned by Peer Data Management Systems (PDMSs), where autonomous sources (peers) model their local data according to a schema, and are connected in a peer-to-peer network by means of pairwise semantic mappings between the peers' own schemas. One of the main challenges in such a semantically heterogeneous environment is concerned with query processing when dealing with the inherent semantic approximations occurring in the data. In this paper we present an instantiation of SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration) for XML data sources. SUNRISE is a complete PDMS infrastructure which extends each peer with functionalities for capturing the semantic approximation originating from schema heterogeneity and exploiting it for a semantically driven network organization and query routing. |
65. | F. Grandi, F. Mandreoli, R. Martoglia, E. Ronchetti, M. R. Scalas, P. Tiberio (2008): Ontology-based Personalization of e-Government Services. Intelligent User Interfaces: Adaptation and Personalization Systems and Technologies, Constantinos Mourlas and Panagiotis Germanakos (Ed.), IGI Global, pp. 167-187, 2008. (Type: Incollection | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @incollection{pub65, title = {Ontology-based Personalization of e-Government Services}, author = {F. Grandi and F. Mandreoli and R. Martoglia and E. Ronchetti and M. R. Scalas and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/gmmp08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Intelligent User Interfaces: Adaptation and Personalization Systems and Technologies, Constantinos Mourlas and Panagiotis Germanakos (Ed.), IGI Global}, pages = {167-187}, abstract = {While the World Wide Web user is suffering form the disease caused by information overload, for which personalization is one of the treatments which work, the citizen who gets ready to use the e-Government services which are made available on the Web is not immune from contagion. This seems a good reason to try to prescribe a personalization treatment also to the e-Government user. Hence, we introduce the design and implementation of Web information systems supporting personalized access to multi-version resources in an e-Government scenario. Personalization is supported by means of Semantic Web techniques and relies on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents (laws, decrees, regulations, etc.) in XML format but can also be generic Web pages and portals or e-Government transactional services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {incollection} } While the World Wide Web user is suffering form the disease caused by information overload, for which personalization is one of the treatments which work, the citizen who gets ready to use the e-Government services which are made available on the Web is not immune from contagion. This seems a good reason to try to prescribe a personalization treatment also to the e-Government user. Hence, we introduce the design and implementation of Web information systems supporting personalized access to multi-version resources in an e-Government scenario. Personalization is supported by means of Semantic Web techniques and relies on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents (laws, decrees, regulations, etc.) in XML format but can also be generic Web pages and portals or e-Government transactional services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed. |
64. | F. Mandreoli, W. Penzo, S. Sassatelli, S. Lodi, R. Martoglia (2008): Boosting a Network of Semantic Peers. Proceedings of the 16th Italian Symposium on Advanced Database Technologies, June 2008 (SEBD 2008), pp. 318-325, Mondello (PA), Italy, 2008. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Sharing, NeP4B Project) @inproceedings{pub66, title = {Boosting a Network of Semantic Peers}, author = {F. Mandreoli and W. Penzo and S. Sassatelli and S. Lodi and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/sebd08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 16th Italian Symposium on Advanced Database Technologies, June 2008 (SEBD 2008)}, pages = {318-325}, address = {Mondello (PA), Italy}, abstract = {In a Peer Data Management System (PDMS), semantic peers connect with each other through semantic mappings between their own schemas. Because of schema heterogeneity, due to peers\' autonomy as for data representation, querying a PDMS implies query reformulations across semantic mappings, possibly incurring in a semantic degradation due to the reiterated approximations given by the traversal of long paths. The linkage closeness of semantically similar peers is thus a crucial issue. In this paper we present a strategy for the incremental maintenance of a flexible network organization for PDMSs that clusters together semantically related peers.}, keywords = {Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } In a Peer Data Management System (PDMS), semantic peers connect with each other through semantic mappings between their own schemas. Because of schema heterogeneity, due to peers' autonomy as for data representation, querying a PDMS implies query reformulations across semantic mappings, possibly incurring in a semantic degradation due to the reiterated approximations given by the traversal of long paths. The linkage closeness of semantically similar peers is thus a crucial issue. In this paper we present a strategy for the incremental maintenance of a flexible network organization for PDMSs that clusters together semantically related peers. |
63. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2008): Efficient and Effective Query Answering in a PDMS with SUNRISE. Proceedings of the 16th Italian Symposium on Advanced Database Technologies, June 2008 (SEBD 2008), pp. 446-451, Mondello (PA), Italy, 2008. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub67, title = {Efficient and Effective Query Answering in a PDMS with SUNRISE}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, url = {http://www.isgroup.unimore.it/article/sebd08_demo.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 16th Italian Symposium on Advanced Database Technologies, June 2008 (SEBD 2008)}, pages = {446-451}, address = {Mondello (PA), Italy}, abstract = {Peer Data Management Systems (PDMSs) have been recently proposed as an evolution of Peer-To-Peer (P2P) systems toward a more semantics-based description of peers\' contents and relationships. In a PDMS scenario a key challenge is query routing, i.e. the capability of selecting small subsets of semantically relevant peers to forward a query to. In this paper we demonstrate SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration), a complete infrastructure which supports an effective and efficient exploration of a PDMS network for query answering purposes. SUNRISE offers several routing policies designed around different performance priorities in order to minimize the information spanning over the network.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Peer Data Management Systems (PDMSs) have been recently proposed as an evolution of Peer-To-Peer (P2P) systems toward a more semantics-based description of peers' contents and relationships. In a PDMS scenario a key challenge is query routing, i.e. the capability of selecting small subsets of semantically relevant peers to forward a query to. In this paper we demonstrate SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration), a complete infrastructure which supports an effective and efficient exploration of a PDMS network for query answering purposes. SUNRISE offers several routing policies designed around different performance priorities in order to minimize the information spanning over the network. |
62. | F. Mandreoli, R. Martoglia, S. Sassatelli, P. Tiberio, W. Penzo, C. Gennaro, M. Mordacchini, S. Orlando (2008): Toward an Effective and Efficient Query Processing in the NeP4B Project. Proceedings of the V Conference of the Italian Chapter of AIS, December 2008 (itAIS 2008), Paris, France, 2008. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub70, title = {Toward an Effective and Efficient Query Processing in the NeP4B Project}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and P. Tiberio and W. Penzo and C. Gennaro and M. Mordacchini and S. Orlando}, url = {http://www.isgroup.unimore.it/article/itais08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the V Conference of the Italian Chapter of AIS, December 2008 (itAIS 2008)}, address = {Paris, France}, abstract = {In this paper we present our main current research activity in the Italian co-funded FIRB Project NeP4B (Networked Peers for Business). In particular, we provide an overview of our P2P query routing approach which combines semantics and multimedia aspects in order to make query processing effective and efficient.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we present our main current research activity in the Italian co-funded FIRB Project NeP4B (Networked Peers for Business). In particular, we provide an overview of our P2P query routing approach which combines semantics and multimedia aspects in order to make query processing effective and efficient. |
61. | F. Grandi, F. Mandreoli, R. Martoglia (2008): Issues in Personalized Access to Multi-Version XML Documents. Open and Novel Issues in XML Database Applications: Future Directions and Advanced Technologies, Eric Pardede (Ed.), IGI Global, pp. 199-230, 2008. (Type: Incollection | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @incollection{pub71, title = { Issues in Personalized Access to Multi-Version XML Documents}, author = {F. Grandi and F. Mandreoli and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/gmm08.pdf}, year = {2008}, date = {2008-01-01}, urldate = {2013-06-12}, booktitle = {Open and Novel Issues in XML Database Applications: Future Directions and Advanced Technologies, Eric Pardede (Ed.), IGI Global}, pages = {199-230}, abstract = {In several application fields including legal and medical domains, XML documents are \"\"versioned\"\" along different dimensions of interest, whose nature depends on the application needs such as time, space and security. Specifically, temporal and semantic versioning is particularly demanding in a broad range of application domains where temporal versioning can be used to maintain histories of the underlying resources along various time dimensions, and semantic versioning can then be used to model limited applicability of resources to individual cases or contexts. The selection and reconstruction of the version(s) of interest for a user means the retrieval of those fragments of documents that match both the implicit and explicit user needs, which can be formalized as what we call personalization queries. In this chapter, we focus on the design and implementation issues of a personalization query processor. We consider different design options and, among them, we introduce an in-depth study of a native solution by showing, also through experimental evaluation, how some of the best performing technological solutions available today for XML data management can be successfully extended and optimally combined in order to support personalization queries.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {incollection} } In several application fields including legal and medical domains, XML documents are ""versioned"" along different dimensions of interest, whose nature depends on the application needs such as time, space and security. Specifically, temporal and semantic versioning is particularly demanding in a broad range of application domains where temporal versioning can be used to maintain histories of the underlying resources along various time dimensions, and semantic versioning can then be used to model limited applicability of resources to individual cases or contexts. The selection and reconstruction of the version(s) of interest for a user means the retrieval of those fragments of documents that match both the implicit and explicit user needs, which can be formalized as what we call personalization queries. In this chapter, we focus on the design and implementation issues of a personalization query processor. We consider different design options and, among them, we introduce an in-depth study of a native solution by showing, also through experimental evaluation, how some of the best performing technological solutions available today for XML data management can be successfully extended and optimally combined in order to support personalization queries. |
2007 | |
60. | F. Mandreoli, R. Martoglia, S. Sassatelli, W. Penzo (2007): Semantic Routing for Effective Search in Heterogeneous and Distributed Digital Libraries. Proceedings of the 3rd Italian Research Conference on Digital Library Management Systems, January 2007 (IRCDL 2007), pp. 67-70, Padova, Italy, 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub54, title = {Semantic Routing for Effective Search in Heterogeneous and Distributed Digital Libraries}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and W. Penzo}, url = {http://www.isgroup.unimore.it/article/ircdl07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 3rd Italian Research Conference on Digital Library Management Systems, January 2007 (IRCDL 2007)}, pages = {67-70}, address = {Padova, Italy}, abstract = {Next generation Digital Libraries (DLs) will offer an entire ensemble of systems and services designed to help users to easily find and access the information they are looking for. However, much work is still required in order to achieve this vision. In this paper, we concentrate our attention on devising techniques allowing an effective routing of queries, which we think can be of the utmost importance in providing effective and efficient querying in heterogeneous and distributed DLs, identifying the best ways to navigate the available nodes and, thus, the documents (or their parts) which are most suitable to best answer the user needs. We describe a routing mechanism, which we call routing by mapping, in which the query is sent to the DL peers whose subnetworks best approximate the concepts required. To this end a distributed index mechanism is adopted, which we call Semantic Routing Index (SRI). We also present some exploratory experiments showing the effectiveness of the proposed approach.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Next generation Digital Libraries (DLs) will offer an entire ensemble of systems and services designed to help users to easily find and access the information they are looking for. However, much work is still required in order to achieve this vision. In this paper, we concentrate our attention on devising techniques allowing an effective routing of queries, which we think can be of the utmost importance in providing effective and efficient querying in heterogeneous and distributed DLs, identifying the best ways to navigate the available nodes and, thus, the documents (or their parts) which are most suitable to best answer the user needs. We describe a routing mechanism, which we call routing by mapping, in which the query is sent to the DL peers whose subnetworks best approximate the concepts required. To this end a distributed index mechanism is adopted, which we call Semantic Routing Index (SRI). We also present some exploratory experiments showing the effectiveness of the proposed approach. |
59. | F. Mandreoli, R. Martoglia, E. Ronchetti (2007): Native Temporal Slicing Support for XML Databases. Proceedings of the International Conference on Internet Computing, June 2007 (ICOMP - XmlTech 2007), pp. 287-293, Las Vegas, Nevada (USA), 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, norma nel tempo project) @inproceedings{pub55, title = {Native Temporal Slicing Support for XML Databases}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/xmltech07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the International Conference on Internet Computing, June 2007 (ICOMP - XmlTech 2007)}, pages = {287-293}, address = {Las Vegas, Nevada (USA)}, abstract = {XML databases, providing structural querying support, are becoming more and more popular. As we know, XML data may change over time and providing an efficient support to queries which also involve temporal aspects is still an open issue. In this paper we present our native Temporal XML Query Processor, which exploits an ad-hoc temporal indexing scheme relying on relational approaches and a technology supporting temporal slicing. As we show through an extensive experimental evaluation, our solution achieves good efficiency results, outperforming stratum-based solutions when dealing with time-related application requirements while continuing to guarantee good performance in traditional scenarios.}, keywords = {Data Versioning, norma nel tempo project}, pubstate = {published}, tppubtype = {inproceedings} } XML databases, providing structural querying support, are becoming more and more popular. As we know, XML data may change over time and providing an efficient support to queries which also involve temporal aspects is still an open issue. In this paper we present our native Temporal XML Query Processor, which exploits an ad-hoc temporal indexing scheme relying on relational approaches and a technology supporting temporal slicing. As we show through an extensive experimental evaluation, our solution achieves good efficiency results, outperforming stratum-based solutions when dealing with time-related application requirements while continuing to guarantee good performance in traditional scenarios. |
58. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2007): SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes.. Proceedings of the 4th European Sematic Web Conference, June 2007 (ESWC 2007), Innsbruck, Austria., 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub56, title = {SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes.}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, url = {http://www.isgroup.unimore.it/article/eswc07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 4th European Sematic Web Conference, June 2007 (ESWC 2007)}, address = {Innsbruck, Austria.}, abstract = {We demonstrate SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration), a complete infrastructure supporting the construction of a PDMS semantic layer and providing a series of techniques that can be used for an effective and efficient exploration of a semantic network, for instance in a query answering setting.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } We demonstrate SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration), a complete infrastructure supporting the construction of a PDMS semantic layer and providing a series of techniques that can be used for an effective and efficient exploration of a semantic network, for instance in a query answering setting. |
57. | F. Mandreoli, R. Martoglia, E. Ronchetti (2007): Disambiguation of Structure-Based Information in the STRIDER System. Proceedings of the 15th Italian Symposium on Advanced Database Technologies, June 2007 (SEBD 2007), pp. 499-502, Torre Canne (Fasano, BR), Italy, 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: NeP4B Project, Structural Disambiguation) @inproceedings{pub57, title = {Disambiguation of Structure-Based Information in the STRIDER System}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/sebd07demo.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 15th Italian Symposium on Advanced Database Technologies, June 2007 (SEBD 2007)}, pages = {499-502}, address = {Torre Canne (Fasano, BR), Italy}, abstract = {We present the current version of STRIDER, a versatile system for the disambiguation of structure-based information like XML schemas, structures of XML documents and web directories. It can be of support to the semantic-awareness of a wide range of applications, thanks to its novel and fully-automated disambiguation algorithms.}, keywords = {NeP4B Project, Structural Disambiguation}, pubstate = {published}, tppubtype = {inproceedings} } We present the current version of STRIDER, a versatile system for the disambiguation of structure-based information like XML schemas, structures of XML documents and web directories. It can be of support to the semantic-awareness of a wide range of applications, thanks to its novel and fully-automated disambiguation algorithms. |
56. | F. Mandreoli, W. Penzo, A. M. Perdichizzi (2007): Semantic Web Service Composition in the NeP4B Project: Challenges and Architectural Issues. Proceedings of the 15th Italian Symposium on Advanced Database Technologies, June 2007 (SEBD 2007), pp. 414-421, Torre Canne (Fasano, BR), Italy, 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Sharing, NeP4B Project) @inproceedings{pub58, title = {Semantic Web Service Composition in the NeP4B Project: Challenges and Architectural Issues}, author = {F. Mandreoli and W. Penzo and A. M. Perdichizzi}, url = {http://www.isgroup.unimore.it/article/sebd07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 15th Italian Symposium on Advanced Database Technologies, June 2007 (SEBD 2007)}, pages = {414-421}, address = {Torre Canne (Fasano, BR), Italy}, abstract = {Semantic Web service discovery and composition frameworks proposed so far assume for the most part a centralized registry that holds information of all the Web services available at any given time. This solution does not well cope with the scalability and flexibility requirements of dynamic, fast changing contexts. As part of the NeP4B project, in this paper we propose an alternative peer to peer architecture based on the Goal concept.}, keywords = {Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Semantic Web service discovery and composition frameworks proposed so far assume for the most part a centralized registry that holds information of all the Web services available at any given time. This solution does not well cope with the scalability and flexibility requirements of dynamic, fast changing contexts. As part of the NeP4B project, in this paper we propose an alternative peer to peer architecture based on the Goal concept. |
55. | F. Mandreoli, W. Penzo, A. M. Perdichizzi (2007): A P2P-based Architecture for Semantic Web Service Automatic Composition. Proceedings of the 1st DEXA International Workshop on Semantic Web Architectures for Enterprises, September 2007 (SWAE 2007), pp. 429-433, Regensburg, Germany, 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Sharing, NeP4B Project) @inproceedings{pub59, title = {A P2P-based Architecture for Semantic Web Service Automatic Composition}, author = {F. Mandreoli and W. Penzo and A. M. Perdichizzi}, url = {http://www.isgroup.unimore.it/article/dexa07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st DEXA International Workshop on Semantic Web Architectures for Enterprises, September 2007 (SWAE 2007)}, pages = {429-433}, address = {Regensburg, Germany}, abstract = {The ultimate vision for eBusiness is an Internet-based global market place, accessible to all enterprises, regardless of size and geographical location, where automatic cooperation and integration among firms are allowed and enhanced. A powerful mean for these purposes is represented by sharing, reusing and composing value-added services made available on the Web, i.e. Web services. In this scenario, by making the Web content machine accessible and understandable, semantic Web services aim to provide efficient and effective Web service automatic discovery, selection, and composition. As part of the NeP4B project, in this paper we propose an architecture to address these issues in a flexible and scalable P2P network.}, keywords = {Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } The ultimate vision for eBusiness is an Internet-based global market place, accessible to all enterprises, regardless of size and geographical location, where automatic cooperation and integration among firms are allowed and enhanced. A powerful mean for these purposes is represented by sharing, reusing and composing value-added services made available on the Web, i.e. Web services. In this scenario, by making the Web content machine accessible and understandable, semantic Web services aim to provide efficient and effective Web service automatic discovery, selection, and composition. As part of the NeP4B project, in this paper we propose an architecture to address these issues in a flexible and scalable P2P network. |
54. | F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, G. Villani (2007): SRI@work: Efficient and Effective Routing Strategies in a PDMS. Proceedings of the 8th International Conference on Web Information Systems Engineering, December 2007 (WISE 2007), pp. 285-297, Nancy, France, 2007. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub60, title = {SRI@work: Efficient and Effective Routing Strategies in a PDMS}, author = {F. Mandreoli and R. Martoglia and W. Penzo and S. Sassatelli and G. Villani}, url = {http://www.isgroup.unimore.it/article/wise07.pdf}, year = {2007}, date = {2007-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 8th International Conference on Web Information Systems Engineering, December 2007 (WISE 2007)}, pages = {285-297}, address = {Nancy, France}, abstract = {In recent years, information sharing has gained much benefit by the large diffusion of distributed computing, namely through P2P systems and, in line with the Semantic Web vision, through Peer Data Management Systems (PDMSs). In a PDMS scenario one of the most difficult challenges is query routing, i.e. the capability of selecting small subsets of semantically relevant peers to forward a query to. In this paper, we put the Semantic Routing Index (SRI) distributed mechanism we proposed in [6] at work. In particular, we present general SRI-based query execution models, designed around different performance priorities and minimizing the information spanning over the network. Starting from these models, we devise several SRI-enabled routing policies, characterized by different effectiveness and efficiency targets, and we deeply test them in ad-hoc PDMS simulation environments.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } In recent years, information sharing has gained much benefit by the large diffusion of distributed computing, namely through P2P systems and, in line with the Semantic Web vision, through Peer Data Management Systems (PDMSs). In a PDMS scenario one of the most difficult challenges is query routing, i.e. the capability of selecting small subsets of semantically relevant peers to forward a query to. In this paper, we put the Semantic Routing Index (SRI) distributed mechanism we proposed in [6] at work. In particular, we present general SRI-based query execution models, designed around different performance priorities and minimizing the information spanning over the network. Starting from these models, we devise several SRI-enabled routing policies, characterized by different effectiveness and efficiency targets, and we deeply test them in ad-hoc PDMS simulation environments. |
2006 | |
53. | F. Mandreoli, R. Martoglia, E. Ronchetti (2006): Supporting temporal slicing in XML databases. Proceedings of the 10th International Conference on Extending Database Technology, March 2006 (EDBT 2006), pp. 295-312, Munich, Germany, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub45, title = {Supporting temporal slicing in XML databases}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/temporalSlicingEDBT06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 10th International Conference on Extending Database Technology, March 2006 (EDBT 2006)}, pages = {295-312}, address = {Munich, Germany}, abstract = {Nowadays XML is universally accepted as the standard for structural data representation; XML databases, providing structural querying support, are thus becoming more and more popular. However, XML data changes over time and the task of providing efficient support to queries which also involve temporal aspects goes through the tricky task of time-slicing the input data. In this paper we take up the challenge of providing a native and efficient solution in constructing an XML query processor supporting temporal slicing, thus dealing with non-conventional application requirements while continuing to guarantee good performance in traditional scenarios. Our contributions include a novel temporal indexing scheme relying on relational approaches and a technology supporting the time-slice operator.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } Nowadays XML is universally accepted as the standard for structural data representation; XML databases, providing structural querying support, are thus becoming more and more popular. However, XML data changes over time and the task of providing efficient support to queries which also involve temporal aspects goes through the tricky task of time-slicing the input data. In this paper we take up the challenge of providing a native and efficient solution in constructing an XML query processor supporting temporal slicing, thus dealing with non-conventional application requirements while continuing to guarantee good performance in traditional scenarios. Our contributions include a novel temporal indexing scheme relying on relational approaches and a technology supporting the time-slice operator. |
52. | F. Mandreoli, R. Martoglia, E. Ronchetti (2006): STRIDER: a Versatile System for Structural Disambiguation. Proceedings of the 10th International Conference on Extending Database Technology, March 2006 (EDBT 2006), pp. 1194-1197, Munich, Germany, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Structural Disambiguation, Wisdom Project) @inproceedings{pub46, title = {STRIDER: a Versatile System for Structural Disambiguation}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/striderDemoEDBT06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 10th International Conference on Extending Database Technology, March 2006 (EDBT 2006)}, pages = {1194-1197}, address = {Munich, Germany}, abstract = {We present STRIDER, a versatile system for the disambiguation of structure-based information like XML schemas, structures of XML documents and web directories. The system performs high-quality fully-automated disambiguation by exploiting a novel and versatile structural disambiguation approach.}, keywords = {Structural Disambiguation, Wisdom Project}, pubstate = {published}, tppubtype = {inproceedings} } We present STRIDER, a versatile system for the disambiguation of structure-based information like XML schemas, structures of XML documents and web directories. The system performs high-quality fully-automated disambiguation by exploiting a novel and versatile structural disambiguation approach. |
51. | F. Mandreoli, R. Martoglia, P. Tiberio, F. Grandi, M. R. Scalas, E. Ronchetti (2006): An eGovernment system for temporal- and semantic-aware access to norms. Proceedings of the Semantic Web meets eGovernment Symposium, March 2006 (SWEG 2006), Stanfornd, USA, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub48, title = {An eGovernment system for temporal- and semantic-aware access to norms}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and F. Grandi and M. R. Scalas and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/sweg06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the Semantic Web meets eGovernment Symposium, March 2006 (SWEG 2006)}, address = {Stanfornd, USA}, abstract = {In this paper, we present the results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a semantic-aware system supporting efficient and personalized access to a multi-version repository of normative texts. The research activity is entitled \"\"Semantic web techniques for the management of digital identity and the access to norms\"\". In the context of a complete and modular infrastructure, we defined a multi-version XML data model and developed a temporal and semantical XML query processor supporting both temporal versioning - essential in normative systems - and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the querying component with several auxiliary services, including automatic citizen identification and classification and assisted update of the repository data.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present the results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a semantic-aware system supporting efficient and personalized access to a multi-version repository of normative texts. The research activity is entitled ""Semantic web techniques for the management of digital identity and the access to norms"". In the context of a complete and modular infrastructure, we defined a multi-version XML data model and developed a temporal and semantical XML query processor supporting both temporal versioning - essential in normative systems - and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the querying component with several auxiliary services, including automatic citizen identification and classification and assisted update of the repository data. |
50. | F. Mandreoli, R. Martoglia, S. Sassatelli, P. Tiberio, W. Penzo (2006): Using Semantic Mappings for Query Routing in a PDMS Environment. Proceedings of the 14th Italian Symposium on Advanced Database Technologies, June 2006 (SEBD 2006), pp. 56-63, Portonovo (AN), Italy, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub49, title = {Using Semantic Mappings for Query Routing in a PDMS Environment}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and P. Tiberio and W. Penzo}, url = {http://www.isgroup.unimore.it/article/sebd06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 14th Italian Symposium on Advanced Database Technologies, June 2006 (SEBD 2006)}, pages = {56-63}, address = {Portonovo (AN), Italy}, abstract = {In this paper we present the current achievement of our research activity in the WISDOM project, whose aim is the definition of intelligent techniques enabling effective and efficient information search in a distributed and decentralized PDMS scenario. We focus on the query routing problem and we define a new routing mechanism, which we call routing by mapping, in which the query is sent to the peers whose subnetworks best approximate the concepts required. In order to select the best subnetworks, the peer receiving the query exploits information about the semantic approximation of the query concepts, when moving towards each neighbour. This information is computed starting from the semantic mappings established with the peer\'s neighbours and it is maintained into specifically devised data structures called Semantic Routing Indices (SRIs), whose update we propose specific algorithms and protocols for. The effectiveness of the achieved results has been experimentally proved through a series of exploratory tests.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we present the current achievement of our research activity in the WISDOM project, whose aim is the definition of intelligent techniques enabling effective and efficient information search in a distributed and decentralized PDMS scenario. We focus on the query routing problem and we define a new routing mechanism, which we call routing by mapping, in which the query is sent to the peers whose subnetworks best approximate the concepts required. In order to select the best subnetworks, the peer receiving the query exploits information about the semantic approximation of the query concepts, when moving towards each neighbour. This information is computed starting from the semantic mappings established with the peer's neighbours and it is maintained into specifically devised data structures called Semantic Routing Indices (SRIs), whose update we propose specific algorithms and protocols for. The effectiveness of the achieved results has been experimentally proved through a series of exploratory tests. |
49. | F. Mandreoli, R. Martoglia, P. Tiberio, F. Grandi, M. R. Scalas, E. Ronchetti (2006): Semantic Web Techniques for Personalization of eGovernment Services. Proceedings of the 1st International Workshop on Semantic Web Applications: Theory and Practice, November 2006 (ER SemWAT 2006), pp. 435-444, Tucson, USA, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub50, title = {Semantic Web Techniques for Personalization of eGovernment Services}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and F. Grandi and M. R. Scalas and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/semwat06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st International Workshop on Semantic Web Applications: Theory and Practice, November 2006 (ER SemWAT 2006)}, pages = {435-444}, address = {Tucson, USA}, abstract = {In this paper, we present the results of an ongoing research involving the design and implementation of systems supporting personalized access to multi-version resources in an eGovernment scenario. Personalization is supported by means of Semantic Web techniques and is based on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents in XML format but can also be generic Web pages and portals or eGovernment services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present the results of an ongoing research involving the design and implementation of systems supporting personalized access to multi-version resources in an eGovernment scenario. Personalization is supported by means of Semantic Web techniques and is based on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents in XML format but can also be generic Web pages and portals or eGovernment services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed. |
48. | F. Mandreoli, R. Martoglia, P. Tiberio, M. Righini (2006): A Native Extensible XML Query Processor Towards Efficient and Effective MPEG-7 Querying. Proceedings of the 2nd Italian Research Conference on Digital Library Management Systems, January 2006 (IRCDL 2006), pp. 21-24, Padova, Italy, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: DELOS Project, Twig Query Processing) @inproceedings{pub51, title = {A Native Extensible XML Query Processor Towards Efficient and Effective MPEG-7 Querying}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and M. Righini}, url = {http://www.isgroup.unimore.it/article/ircdl06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 2nd Italian Research Conference on Digital Library Management Systems, January 2006 (IRCDL 2006)}, pages = {21-24}, address = {Padova, Italy}, abstract = {In recent years the production of massive amounts of visual information has led to the arrival of very large multimedia Digital Libraries (DLs). The key to support efficient search and management operations in such repositories is to exploit metadata information for digital media, such as MPEG-7 based ones, which seem to be the most widely accepted. The underlying XML syntax, together with the high versatility of the provided constructs, make it easy to specify significant and complex queries, however executing them efficiently on huge quantities of data is not a trivial task. In this paper we provide an overview of the XSiter system, a native and extensible XML query processor providing very high performance in general XML querying settings and whose flexible architecture can be easily enhanced to better support the peculiarities of retrieving multimedia objects through MPEG-7 annotation metadata. Further, we consider possible \"\"use-cases\"\" and tasks related to multimedia and video DLs querying and management which our system can successfully accomplish.}, keywords = {DELOS Project, Twig Query Processing}, pubstate = {published}, tppubtype = {inproceedings} } In recent years the production of massive amounts of visual information has led to the arrival of very large multimedia Digital Libraries (DLs). The key to support efficient search and management operations in such repositories is to exploit metadata information for digital media, such as MPEG-7 based ones, which seem to be the most widely accepted. The underlying XML syntax, together with the high versatility of the provided constructs, make it easy to specify significant and complex queries, however executing them efficiently on huge quantities of data is not a trivial task. In this paper we provide an overview of the XSiter system, a native and extensible XML query processor providing very high performance in general XML querying settings and whose flexible architecture can be easily enhanced to better support the peculiarities of retrieving multimedia objects through MPEG-7 annotation metadata. Further, we consider possible ""use-cases"" and tasks related to multimedia and video DLs querying and management which our system can successfully accomplish. |
47. | F. Mandreoli, R. Martoglia, S. Sassatelli, W. Penzo (2006): SRI: Exploiting Semantic Information for Effective Query Routing in a PDMS. Proceedings of the 8th ACM CIKM International Workshop on Web Information and Data Management, November 2006 (WIDM 2006), pp. 19-26, Arlington, USA, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub52, title = {SRI: Exploiting Semantic Information for Effective Query Routing in a PDMS}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and W. Penzo}, url = {http://www.isgroup.unimore.it/article/widm06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 8th ACM CIKM International Workshop on Web Information and Data Management, November 2006 (WIDM 2006)}, pages = {19-26}, address = {Arlington, USA}, abstract = {The huge amount of data available from Internet information sources has focused much attention on the sharing of distributed information through Peer Data Management Systems (PDMSs). In a PDMS, peers have a schema on their local data, and they are related each other through semantic mappings that can be defined between their own schemas. Querying a PDMS means either flooding the network with messages to all peers or take advantage of a routing mechanism to reformulate a query only on the best peers selected according to some given criteria. As reformulations may lead to semantic approximations, we deem that such approximations can be exploited for locating the semantically best directions to forward a query to. In this paper, we propose a distributed index mechanism where each peer is provided with a Semantic Routing Index (SRI) for routing queries effectively. A fuzzy-oriented model for SRI is presented where operations for creating and maintaining SRIs are well-founded. In addition, we show how SRIs can be employed in the query processing phase with the aim of reducing the space of reformulations. Finally, we conduct a series of meaningful experiments showing the effectiveness of the proposed approach.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } The huge amount of data available from Internet information sources has focused much attention on the sharing of distributed information through Peer Data Management Systems (PDMSs). In a PDMS, peers have a schema on their local data, and they are related each other through semantic mappings that can be defined between their own schemas. Querying a PDMS means either flooding the network with messages to all peers or take advantage of a routing mechanism to reformulate a query only on the best peers selected according to some given criteria. As reformulations may lead to semantic approximations, we deem that such approximations can be exploited for locating the semantically best directions to forward a query to. In this paper, we propose a distributed index mechanism where each peer is provided with a Semantic Routing Index (SRI) for routing queries effectively. A fuzzy-oriented model for SRI is presented where operations for creating and maintaining SRIs are well-founded. In addition, we show how SRIs can be employed in the query processing phase with the aim of reducing the space of reformulations. Finally, we conduct a series of meaningful experiments showing the effectiveness of the proposed approach. |
46. | F. Mandreoli, R. Martoglia, S. Sassatelli, W. Penzo (2006): Semantic Query Routing Experiences in a PDMS. Proceedings of the 3rd Italian Semantic Web Workshop, December 2006 (SWAP 2006), Pisa, Italy, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, NeP4B Project) @inproceedings{pub53, title = {Semantic Query Routing Experiences in a PDMS}, author = {F. Mandreoli and R. Martoglia and S. Sassatelli and W. Penzo}, url = {http://www.isgroup.unimore.it/article/swap06.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 3rd Italian Semantic Web Workshop, December 2006 (SWAP 2006)}, address = {Pisa, Italy}, abstract = {Querying a PDMS means either flooding the network with messages to all peers or taking advantage of a routing mechanism to reformulate a query only on the best peers selected according to some given criteria. As reformulations may lead to semantic approximations, we deem that such approximations can be exploited for locating the semantically best directions to forward a query to. In this paper, we present our experiences in devising and testing a mechanism for effective query routing in a PDMS. In particular, we describe a distributed index mechanism where each peer is provided with a Semantic Routing Index (SRI) for routing queries effectively. We illustrate SRIs\' structure, their use and the framework we devised for their incremental update, then we provide an extensive evaluation of their effectiveness through a set of query routing experiments on a variety of scenarios. This work is partially supported by the PRIN WISDOM and FIRB NeP4B national projects.}, keywords = {Approximate search, Data Sharing, NeP4B Project}, pubstate = {published}, tppubtype = {inproceedings} } Querying a PDMS means either flooding the network with messages to all peers or taking advantage of a routing mechanism to reformulate a query only on the best peers selected according to some given criteria. As reformulations may lead to semantic approximations, we deem that such approximations can be exploited for locating the semantically best directions to forward a query to. In this paper, we present our experiences in devising and testing a mechanism for effective query routing in a PDMS. In particular, we describe a distributed index mechanism where each peer is provided with a Semantic Routing Index (SRI) for routing queries effectively. We illustrate SRIs' structure, their use and the framework we devised for their incremental update, then we provide an extensive evaluation of their effectiveness through a set of query routing experiments on a variety of scenarios. This work is partially supported by the PRIN WISDOM and FIRB NeP4B national projects. |
45. | F. Mandreoli, R. Martoglia, P. Tiberio (2006): EXTRA: a system for example-based translation assistance. Machine Translation (MT), 20 (3), pp. 167-197, 2006. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search, EBMT) @article{pub62, title = {EXTRA: a system for example-based translation assistance}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, journal = {Machine Translation (MT)}, volume = {20}, number = {3}, pages = {167-197}, abstract = {Nowadays we are witnessing the need to translate ever increasing quantities of texts, with an ever increasing quality. The expertise and skill of professional translators is not alone entirely sufficient in order to achieve highly effective and efficient translation performance. The best way to translate very large quantities of documents, while ensuring optimal translation time and costs, is to exploit Example-Based Machine Translation (EBMT), which is devised in the aim of achieving better quality and quantity in less time, while preserving and treasuring the richness and accuracy that only human translation can achieve. In this paper we present EXTRA (EXample-based TRanslation Assistant), the EBMT system we have developed over the last few years to support the translation of texts written in Western languages. EXTRA is able to propose effective translation suggestions by relying on syntactic analysis of the text and on a rigorous, language-independent measure; the search is performed efficiently in large amounts of bilingual texts thanks to its advanced retrieval techniques. Furthermore, EXTRA does not use external knowledge requiring the intervention of users and is completely customizable and portable as it has been implemented on top of a standard DataBase Management System (DBMS). In the paper we also provide a thorough evaluation of both the effectiveness and the efficiency of our system. In particular, in order to quantify the benefits offered by EXTRA assisted translation over manual translation, we introduce a simulator implementing specifically devised statistical, process-oriented, discrete-event models.}, keywords = {Approximate search, EBMT}, pubstate = {published}, tppubtype = {article} } Nowadays we are witnessing the need to translate ever increasing quantities of texts, with an ever increasing quality. The expertise and skill of professional translators is not alone entirely sufficient in order to achieve highly effective and efficient translation performance. The best way to translate very large quantities of documents, while ensuring optimal translation time and costs, is to exploit Example-Based Machine Translation (EBMT), which is devised in the aim of achieving better quality and quantity in less time, while preserving and treasuring the richness and accuracy that only human translation can achieve. In this paper we present EXTRA (EXample-based TRanslation Assistant), the EBMT system we have developed over the last few years to support the translation of texts written in Western languages. EXTRA is able to propose effective translation suggestions by relying on syntactic analysis of the text and on a rigorous, language-independent measure; the search is performed efficiently in large amounts of bilingual texts thanks to its advanced retrieval techniques. Furthermore, EXTRA does not use external knowledge requiring the intervention of users and is completely customizable and portable as it has been implemented on top of a standard DataBase Management System (DBMS). In the paper we also provide a thorough evaluation of both the effectiveness and the efficiency of our system. In particular, in order to quantify the benefits offered by EXTRA assisted translation over manual translation, we introduce a simulator implementing specifically devised statistical, process-oriented, discrete-event models. |
44. | F. Grandi, F. Mandreoli, R. Martoglia, M. R. Scalas (2006): Efficient Management of Multi-Version XML Documents for eGovernment Applications. Proceedings of the 1st International Conference on Web Information Systems and Technologies, Revised Selected Papers, 2006 (WEBIST Selected Paper 2006), pp. 283-294, 2006. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub73, title = {Efficient Management of Multi-Version XML Documents for eGovernment Applications}, author = {F. Grandi and F. Mandreoli and R. Martoglia and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/webist05.pdf}, year = {2006}, date = {2006-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st International Conference on Web Information Systems and Technologies, Revised Selected Papers, 2006 (WEBIST Selected Paper 2006)}, pages = {283-294}, abstract = {This paper describes our research activities in developing efficient systems for the management of multiversion XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (publication, validity, efficacy and transaction times) were used to represent the evolution of norms in time and their resulting versioning and a stratum approach was used for its implementation on top of a relational DBMS. Recently, the multi-version management system has migrated to a different architecture (\"\"native\"\" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the stratum approach and a good scalability behaviour. Current work includes a more accurate modeling of the citizen\'s ontology, which could also require a redesign of the document storage scheme, and the development of a complete infrastructure for the management of the citizen\'s digital identity.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } This paper describes our research activities in developing efficient systems for the management of multiversion XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (publication, validity, efficacy and transaction times) were used to represent the evolution of norms in time and their resulting versioning and a stratum approach was used for its implementation on top of a relational DBMS. Recently, the multi-version management system has migrated to a different architecture (""native"" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the stratum approach and a good scalability behaviour. Current work includes a more accurate modeling of the citizen's ontology, which could also require a redesign of the document storage scheme, and the development of a complete infrastructure for the management of the citizen's digital identity. |
2005 | |
43. | F. Mandreoli, R. Martoglia, P. Tiberio (2005): Text Clustering as a Mining Task. Text Mining and its Applications to Intelligence, CRM and Knowledge Management, pp. 75-108, 2005. (Type: Incollection | Abstract | BibTeX | Tags: Approximate search) @incollection{pub9, title = {Text Clustering as a Mining Task}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Text Mining and its Applications to Intelligence, CRM and Knowledge Management}, pages = {75-108}, abstract = {In this chapter we introduce readers to the various aspects of cluster analysis performed on textual data in a mining framework. We first provide a brief overview on the techniques and the background notions on general clustering. Then, we focus on the importance and on the goals of clustering in a text mining scenario, analyzing and describing the issues which are specific to this particular field. Effective information extraction from highly dimensional textual data, clustering algorithms specifically designed to efficiently work on very large unstructured and, possibly, hyperlinked data sets, and comprehension of the clustering output are among the covered topics.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {incollection} } In this chapter we introduce readers to the various aspects of cluster analysis performed on textual data in a mining framework. We first provide a brief overview on the techniques and the background notions on general clustering. Then, we focus on the importance and on the goals of clustering in a text mining scenario, analyzing and describing the issues which are specific to this particular field. Effective information extraction from highly dimensional textual data, clustering algorithms specifically designed to efficiently work on very large unstructured and, possibly, hyperlinked data sets, and comprehension of the clustering output are among the covered topics. |
42. | F. Mandreoli, R. Martoglia, F. Grandi, M. R. Scalas (2005): Efficient Management of Multi-Version XML Documents for eGovernment Applications. Proceedings of the 1st International Conference on Web Information Systems and Technologies, May 2005 (WEBIST 2005), pp. 283-294, Miami, USA, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub36, title = {Efficient Management of Multi-Version XML Documents for eGovernment Applications}, author = {F. Mandreoli and R. Martoglia and F. Grandi and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/webist05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st International Conference on Web Information Systems and Technologies, May 2005 (WEBIST 2005)}, pages = {283-294}, address = {Miami, USA}, abstract = {This paper describes our research activities in developing efficient systems for the management of multiversion XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (publication, validity, efficacy and transaction times) were used to represent the evolution of norms in time and their resulting versioning and a stratum approach was used for its implementation on top of a relational DBMS. Recently, the multi-version management system has migrated to a different architecture (\"\"native\"\" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the stratum approach and a good scalability behaviour. Current work includes a more accurate modeling of the citizen\'s ontology, which could also require a redesign of the document storage scheme, and the development of a complete infrastructure for the management of the citizen\'s digital identity.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } This paper describes our research activities in developing efficient systems for the management of multiversion XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (publication, validity, efficacy and transaction times) were used to represent the evolution of norms in time and their resulting versioning and a stratum approach was used for its implementation on top of a relational DBMS. Recently, the multi-version management system has migrated to a different architecture (""native"" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the stratum approach and a good scalability behaviour. Current work includes a more accurate modeling of the citizen's ontology, which could also require a redesign of the document storage scheme, and the development of a complete infrastructure for the management of the citizen's digital identity. |
41. | F. Mandreoli, R. Martoglia, P. Tiberio, F. Grandi, M. R. Scalas, E. Ronchetti (2005): Personalized access to multi-version XML documents in an eGovernment scenario. Proceedings of the 13th Italian Symposium on Advanced Database Technologies, June 2005 (SEBD 2005), pp. 256-263, Bressanone (BZ), Italy, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub37, title = {Personalized access to multi-version XML documents in an eGovernment scenario}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and F. Grandi and M. R. Scalas and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/sebd05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 13th Italian Symposium on Advanced Database Technologies, June 2005 (SEBD 2005)}, pages = {256-263}, address = {Bressanone (BZ), Italy}, abstract = {In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multiversion repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning -essential in normative systems- and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. We describe the organization and present preliminary performance figures of a prototype system we developed.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multiversion repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning -essential in normative systems- and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. We describe the organization and present preliminary performance figures of a prototype system we developed. |
40. | F. Mandreoli, R. Martoglia, P. Tiberio, F. Grandi, M. R. Scalas, E. Ronchetti (2005): Personalized Access to Multi-version Norm Texts in an eGovernment Scenario. Proceedings of the International Conference on E-Government, August 2005 (DEXA EGOV 2005), pp. 281-290, Copenhagen, Denmark, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub38, title = {Personalized Access to Multi-version Norm Texts in an eGovernment Scenario}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and F. Grandi and M. R. Scalas and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/egov05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the International Conference on E-Government, August 2005 (DEXA EGOV 2005)}, pages = {281-290}, address = {Copenhagen, Denmark}, abstract = {In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multi-version repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning -essential in normative systems- and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. We describe the organization and present preliminary performance figures of a prototype system we developed.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multi-version repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning -essential in normative systems- and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. We describe the organization and present preliminary performance figures of a prototype system we developed. |
39. | F. Grandi, M. R. Scalas, F. Mandreoli, R. Martoglia (2005): Personalized Access to Multi-Version Documents for E-Government Applications. Proceedings of the IADIS International Conference E-Society 20, July 2005 (IADIS E-Society 2005), Qawra, Malta, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub39, title = {Personalized Access to Multi-Version Documents for E-Government Applications}, author = {F. Grandi and M. R. Scalas and F. Mandreoli and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/iadis05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the IADIS International Conference E-Society 20, July 2005 (IADIS E-Society 2005)}, address = {Qawra, Malta}, abstract = {In this paper we describe the design and implementation of two prototype systems for the efficient management of multi-version XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (validity, efficacy, transaction and publication times) were used to represent the evolution of norms in time and their resulting versioning and a \"\"stratum\"\" approach was used for its implementation on top of an object-relational DBMS. Recently, the multi-version management system has migrated to a different architecture (\"\"native\"\" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the \"\"stratum\"\" approach and a good scalability behavior.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we describe the design and implementation of two prototype systems for the efficient management of multi-version XML documents in an e-Government scenario. The application aim is to enable citizens to access personalized versions of resources, like norm texts and information made available on the Web by public administrations. In the first system developed, four temporal dimensions (validity, efficacy, transaction and publication times) were used to represent the evolution of norms in time and their resulting versioning and a ""stratum"" approach was used for its implementation on top of an object-relational DBMS. Recently, the multi-version management system has migrated to a different architecture (""native"" approach) based on a multi-version XML query processor developed on purpose. Moreover, a new semantic dimension has been added to the versioning mechanism, in order to represent applicability of norms to different classes of citizens according to their digital identity. Classification of citizens is based on the management of an ontology with the deployment of semantic Web techniques. Preliminary experiments showed an encouraging performance improvement with respect to the ""stratum"" approach and a good scalability behavior. |
38. | F. Mandreoli, R. Martoglia, P. Tiberio, E. Ronchetti, F. Grandi, M. R. Scalas (2005): Accesso Personalizzato a Documenti Multiversione per Applicazioni nel Settore dell'E-Government. Atti del Congresso Nazionale AICA 20, October 2005 (AICA 2005), Udine, Italia, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub40, title = {Accesso Personalizzato a Documenti Multiversione per Applicazioni nel Settore dell\'E-Government}, author = {F. Mandreoli and R. Martoglia and P. Tiberio and E. Ronchetti and F. Grandi and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/aica05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Atti del Congresso Nazionale AICA 20, October 2005 (AICA 2005)}, address = {Udine, Italia}, abstract = {In questo lavoro viene presentata l\'attivita\' di ricerca concernente la realizzazione di sistemi prototipali per la gestione efficiente di documenti XML multiversione in uno scenario di e-Government. Lo scopo applicativo di tali sistemi e\' di permettere al cittadino l\'accesso a versioni personalizzate di risorse quali testi normativi e informazioni rese disponibili sul WEB dalle Pubbliche Amministrazioni. Per rappresentare l\'evoluzione delle norme nel tempo e il conseguente \"\"versionamento\"\" si sono usate quattro dimensioni temporali e un\'ulteriore dimensione semantica per rappresentare l\'applicabilita\' delle norme a differenti classi di cittadini, in accordo alla loro identita\' digitale. La classificazione dei cittadini e\' basata sulla gestione di un\'ontologia e l\'adozione di tecniche di Semantic WEB. L\'attuale implementazione, evoluzione di un approccio di tipo \"\"stratum\"\" (sviluppato on top di una piattaforma RDBMS), e\' basata su un approccio \"\"nativo\"\" consistente in un query processor XML sviluppato ad-hoc. Una sperimentazione preliminare ha evidenziato nel nuovo sistema buoni livelli di prestazioni e scalabilita\'.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In questo lavoro viene presentata l'attivita' di ricerca concernente la realizzazione di sistemi prototipali per la gestione efficiente di documenti XML multiversione in uno scenario di e-Government. Lo scopo applicativo di tali sistemi e' di permettere al cittadino l'accesso a versioni personalizzate di risorse quali testi normativi e informazioni rese disponibili sul WEB dalle Pubbliche Amministrazioni. Per rappresentare l'evoluzione delle norme nel tempo e il conseguente ""versionamento"" si sono usate quattro dimensioni temporali e un'ulteriore dimensione semantica per rappresentare l'applicabilita' delle norme a differenti classi di cittadini, in accordo alla loro identita' digitale. La classificazione dei cittadini e' basata sulla gestione di un'ontologia e l'adozione di tecniche di Semantic WEB. L'attuale implementazione, evoluzione di un approccio di tipo ""stratum"" (sviluppato on top di una piattaforma RDBMS), e' basata su un approccio ""nativo"" consistente in un query processor XML sviluppato ad-hoc. Una sperimentazione preliminare ha evidenziato nel nuovo sistema buoni livelli di prestazioni e scalabilita'. |
37. | F. Mandreoli, R. Martoglia, E. Ronchetti (2005): Versatile Structural Disambiguation for Semantic-aware Applications. Proceedings of the 14th ACM International Conference on Information Knowledge and Management, November 2005 (ACM CIKM 2005), pp. 209-216, Bremen, Germany, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Structural Disambiguation, Wisdom Project) @inproceedings{pub41, title = {Versatile Structural Disambiguation for Semantic-aware Applications}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/cikm05.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 14th ACM International Conference on Information Knowledge and Management, November 2005 (ACM CIKM 2005)}, pages = {209-216}, address = {Bremen, Germany}, abstract = {In this paper, we propose a versatile disambiguation approach which can be used to make explicit the meaning of structure based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems, from XML data clustering to ontology-based automatic annotation of web pages and query expansion. The effectiveness of the achieved results has been experimentally proved and is founded both on a flexible exploitation of the structure context, whose extraction can be tailored on the specific application needs, and of the information provided by commonly available thesauri such as WordNet.}, keywords = {Structural Disambiguation, Wisdom Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we propose a versatile disambiguation approach which can be used to make explicit the meaning of structure based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems, from XML data clustering to ontology-based automatic annotation of web pages and query expansion. The effectiveness of the achieved results has been experimentally proved and is founded both on a flexible exploitation of the structure context, whose extraction can be tailored on the specific application needs, and of the information provided by commonly available thesauri such as WordNet. |
36. | F. Mandreoli, P. Tiberio, F. Grandi (2005): Temporal Modelling and Management of Normative Documents in XML Format. Data & Knowledge Engineering (DKE), 54 (3), pp. 327-354, 2005. (Type: Journal Article | Abstract | BibTeX | Tags: Data Versioning, norma nel tempo project) @article{pub42, title = {Temporal Modelling and Management of Normative Documents in XML Format}, author = {F. Mandreoli and P. Tiberio and F. Grandi}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, journal = {Data & Knowledge Engineering (DKE)}, volume = {54}, number = {3}, pages = {327-354}, abstract = {In this paper, we present the results of a research project concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and define basic mechanisms for the maintenance and retrieval of multiversion norm texts. Finally, we describe a prototype management system which has been implemented and evaluated.}, keywords = {Data Versioning, norma nel tempo project}, pubstate = {published}, tppubtype = {article} } In this paper, we present the results of a research project concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and define basic mechanisms for the maintenance and retrieval of multiversion norm texts. Finally, we describe a prototype management system which has been implemented and evaluated. |
35. | F. Mandreoli, R. Martoglia, E. Ronchetti (2005): Improving Semantic Awareness of Knowledge-based Applications through Structural Disambiguation. Proceedings of the 2nd Italian Semantic Web Workshop, December 2005 (SWAP 2005), Trento, Italy, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Structural Disambiguation, Wisdom Project) @inproceedings{pub43, title = {Improving Semantic Awareness of Knowledge-based Applications through Structural Disambiguation}, author = {F. Mandreoli and R. Martoglia and E. Ronchetti}, url = {http://www.isgroup.unimore.it/article/swap05a.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 2nd Italian Semantic Web Workshop, December 2005 (SWAP 2005)}, address = {Trento, Italy}, abstract = {In this paper, we summarize the features of the versatile disambiguation approach we recentlty presented. Its main aim is to make explicit the meaning of structure-based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems. In this paper, we summarize the features of the versatile disambiguation approach we recentlty presented. Its main aim is to make explicit the meaning of structure-based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems, from XML data clustering to ontology-based automatic annotation of web pages and query expansion. The effectiveness of the achieved results has been experimentally proved and is founded both on a flexible exploitation of the structure context, whose extraction can be tailored on the specific application needs, and of the information provided by commonly available thesauri such as WordNet. This work is partially supported by the Italian Council co-funded project WISDOM.}, keywords = {Structural Disambiguation, Wisdom Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we summarize the features of the versatile disambiguation approach we recentlty presented. Its main aim is to make explicit the meaning of structure-based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems. In this paper, we summarize the features of the versatile disambiguation approach we recentlty presented. Its main aim is to make explicit the meaning of structure-based information such as XML schemas, XML document structures, web directories, and ontologies. It can be of support to the semantic-awareness of a wide range of applications, from schema matching and query rewriting to peer data management systems, from XML data clustering to ontology-based automatic annotation of web pages and query expansion. The effectiveness of the achieved results has been experimentally proved and is founded both on a flexible exploitation of the structure context, whose extraction can be tailored on the specific application needs, and of the information provided by commonly available thesauri such as WordNet. This work is partially supported by the Italian Council co-funded project WISDOM. |
34. | F. Grandi, F. Mandreoli, R. Martoglia, E. Ronchetti, M. R. Scalas, P. Tiberio (2005): Enhanced access to eGovernment services: temporal and semantics-aware retrieval of norms. Proceedings of the 2nd Italian Semantic Web Workshop, December 2005 (SWAP 2005), Trento, Italy, 2005. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub44, title = {Enhanced access to eGovernment services: temporal and semantics-aware retrieval of norms}, author = {F. Grandi and F. Mandreoli and R. Martoglia and E. Ronchetti and M. R. Scalas and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/swap05b.pdf}, year = {2005}, date = {2005-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 2nd Italian Semantic Web Workshop, December 2005 (SWAP 2005)}, address = {Trento, Italy}, abstract = {In this paper, we summarize the results of an ongoing research involving the design and implementation of a multi-version repository of norm texts supporting efficient and personalized access in an eGovernment scenario. The research activity is entitled \"\"Semantic web techniques for the management of digital identity and the access to norms\"\". In the context of a complete and modular infrastructure, we defined a multiversion XML data model and developed an XML query processor supporting both temporal and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the query answering component with several auxiliary services, including automatic citizen identification and classification and computer-aided update of the repository data.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we summarize the results of an ongoing research involving the design and implementation of a multi-version repository of norm texts supporting efficient and personalized access in an eGovernment scenario. The research activity is entitled ""Semantic web techniques for the management of digital identity and the access to norms"". In the context of a complete and modular infrastructure, we defined a multiversion XML data model and developed an XML query processor supporting both temporal and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the query answering component with several auxiliary services, including automatic citizen identification and classification and computer-aided update of the repository data. |
2004 | |
33. | F. Mandreoli, R. Martoglia, P. Zezula (2004): Unordered XML Pattern Matching with Tree Signatures. Atti del Dodicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2004 (SEBD 2004), pp. 78-85, S. Margherita di Pula, Italy, 2004. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Twig Query Processing) @inproceedings{pub1, title = {Unordered XML Pattern Matching with Tree Signatures}, author = {F. Mandreoli and R. Martoglia and P. Zezula}, url = {http://www.isgroup.unimore.it/article/sebd04sig.pdf}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, booktitle = {Atti del Dodicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2004 (SEBD 2004)}, pages = {78-85}, address = {S. Margherita di Pula, Italy}, abstract = {We propose an efficient approach for finding relevant XML data twigs defined by unordered query tree specifications. We use the tree signatures as the index structure and find qualifying patterns through integration of structurally consistent query path qualifications. An efficient technique is proposed and its implementation tested on real-life data collections.}, keywords = {Twig Query Processing}, pubstate = {published}, tppubtype = {inproceedings} } We propose an efficient approach for finding relevant XML data twigs defined by unordered query tree specifications. We use the tree signatures as the index structure and find qualifying patterns through integration of structurally consistent query path qualifications. An efficient technique is proposed and its implementation tested on real-life data collections. |
32. | F. Mandreoli, R. Martoglia (2004): Exploiting related digital library corpora with query rewriting. Atti del Dodicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2004 (SEBD 2004), pp. 362-369, S. Margherita di Pula, Italy, 2004. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, ECD Project) @inproceedings{pub3, title = {Exploiting related digital library corpora with query rewriting}, author = {F. Mandreoli and R. Martoglia}, url = {http://www.isgroup.unimore.it/article/sebd04smart.pdf}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, booktitle = {Atti del Dodicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2004 (SEBD 2004)}, pages = {362-369}, address = {S. Margherita di Pula, Italy}, abstract = {In this paper, we present the preliminary results of the ongoing research activity we are carrying out in the context of approximate XML query answering when the schemas of the XML documents are available. The method we propose involves a preliminary schema matching process, which automatically identifies the semantic and structural similarities between the schema elements to be used in the subsequent operation of query rewriting, in which a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service, named XML S3MART, which is part of the open architecture proposed in the ongoing Italian CNR co-funded ECD Project.}, keywords = {Approximate search, Data Sharing, ECD Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present the preliminary results of the ongoing research activity we are carrying out in the context of approximate XML query answering when the schemas of the XML documents are available. The method we propose involves a preliminary schema matching process, which automatically identifies the semantic and structural similarities between the schema elements to be used in the subsequent operation of query rewriting, in which a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service, named XML S3MART, which is part of the open architecture proposed in the ongoing Italian CNR co-funded ECD Project. |
31. | F. Mandreoli, R. Martoglia, P. Zezula (2004): Tree Signatures and Unordered XML Pattern Matching. Proceedings of 30th Conference on Current Trends in Theory and Practice of Computer Science, January 2004 (SOFSEM 2004), pp. 122-139, Merin, Czech Republic, 2004. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Twig Query Processing) @inproceedings{pub4, title = {Tree Signatures and Unordered XML Pattern Matching}, author = {F. Mandreoli and R. Martoglia and P. Zezula}, url = {http://www.isgroup.unimore.it/article/sofsem04.pdf}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of 30th Conference on Current Trends in Theory and Practice of Computer Science, January 2004 (SOFSEM 2004)}, pages = {122-139}, address = {Merin, Czech Republic}, abstract = {We propose an efficient approach for finding relevant XML data twigs defined by unordered query tree specifications. We use the tree signatures as the index structure and find qualifying patterns through integration of structurally consistent query path qualifications. An efficient algorithm is proposed and its implementation tested on real-life data collections.}, keywords = {Twig Query Processing}, pubstate = {published}, tppubtype = {inproceedings} } We propose an efficient approach for finding relevant XML data twigs defined by unordered query tree specifications. We use the tree signatures as the index structure and find qualifying patterns through integration of structurally consistent query path qualifications. An efficient algorithm is proposed and its implementation tested on real-life data collections. |
30. | F. Mandreoli, R. Martoglia, P. Tiberio (2004): A Document Comparison Scheme for Secure Duplicate Detection. International Journal on Digital Libraries (IJDL), 4 (3), pp. 223-244, 2004. (Type: Journal Article | Abstract | BibTeX | Tags: Approximate search) @article{pub8, title = {A Document Comparison Scheme for Secure Duplicate Detection}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, journal = {International Journal on Digital Libraries (IJDL)}, volume = {4}, number = {3}, pages = {223-244}, abstract = {The ever-growing volumes of textual information from various sources have fostered the development of digital libraries, making digital content readily accessible but also easy for malicious users to plagiarize, thus giving rise to security problems. In this paper, we introduce a duplicate detection scheme that is able to determine, with a particularly high accuracy, the degree to which one document is similar to another. Our pairwise document comparison scheme detects the resemblance between the content of documents by considering document chunks, representing contexts of words selected from the text. The resulting duplicate detection technique presents a good level of security in the protection of intellectual property while improving the availability of the data stored in the digital library and the correctness of the search results. Finally, the paper addresses efficiency and scalability issues by introducing new data reduction techniques.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {article} } The ever-growing volumes of textual information from various sources have fostered the development of digital libraries, making digital content readily accessible but also easy for malicious users to plagiarize, thus giving rise to security problems. In this paper, we introduce a duplicate detection scheme that is able to determine, with a particularly high accuracy, the degree to which one document is similar to another. Our pairwise document comparison scheme detects the resemblance between the content of documents by considering document chunks, representing contexts of words selected from the text. The resulting duplicate detection technique presents a good level of security in the protection of intellectual property while improving the availability of the data stored in the digital library and the correctness of the search results. Finally, the paper addresses efficiency and scalability issues by introducing new data reduction techniques. |
29. | F. Mandreoli, R. Martoglia, P. Tiberio (2004): Approximate Query Answering for a Heterogeneous XML Document Base. Proceedings of the 5th International Conference on Web Information Systems Engineering, November 2004 (WISE 2004), pp. 337-351, Brisbane, AU, 2004. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, Data Sharing, ECD Project) @inproceedings{pub10, title = {Approximate Query Answering for a Heterogeneous XML Document Base}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/wise04.pdf}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 5th International Conference on Web Information Systems Engineering, November 2004 (WISE 2004)}, pages = {337-351}, address = {Brisbane, AU}, abstract = {In this paper, we deal with the problem of effective search and query answering in heterogeneous web document bases containing documents in XML format of which the schemas are available. We propose a new solution for the structural approximation of the submitted queries which, in a preliminary schema matching process, is able to automatically identify the similarities between the involved schemas and to use them in the query processing phase, when a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service and can deliver middleware rewriting services in any open-architecture XML repository system offering advanced search capabilities.}, keywords = {Approximate search, Data Sharing, ECD Project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we deal with the problem of effective search and query answering in heterogeneous web document bases containing documents in XML format of which the schemas are available. We propose a new solution for the structural approximation of the submitted queries which, in a preliminary schema matching process, is able to automatically identify the similarities between the involved schemas and to use them in the query processing phase, when a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service and can deliver middleware rewriting services in any open-architecture XML repository system offering advanced search capabilities. |
28. | F. Mandreoli, P. Tiberio, F. Grandi, M. R. Scalas (2004): Management of the Citizen's Digital Identity and Access to Multi-version Norm Texts on the Semantic Web. Proceedings of the International Symposium on Challenges in the Internet and Interdisciplinary Research, August 2004 (IPSI 2004), Pescara, Italy, 2004. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, EGov Project) @inproceedings{pub35, title = {Management of the Citizen\'s Digital Identity and Access to Multi-version Norm Texts on the Semantic Web}, author = {F. Mandreoli and P. Tiberio and F. Grandi and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/ipsi04.pdf}, year = {2004}, date = {2004-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the International Symposium on Challenges in the Internet and Interdisciplinary Research, August 2004 (IPSI 2004)}, address = {Pescara, Italy}, abstract = {This paper describes an ongoing research project involving the implementation of e-Government services on the Semantic Web. In particular, the project is aimed at managing the \"\"digital identity\"\" of citizens on the Internet, enabling them to benefit from \"\"personalized\"\" versions of the online services offered by the Public Administration, which can improve and optimize their involvement in the e-Governance process. The kind of service we will consider is the selective access to norm texts available on Web repositories. The project requires the definition and maintenance of a citizen\'s ontology, the semantic markup and versioning of the stored norm texts which takes into account the actual applicability to different classes of citizens, the definition and enactment of Web services for the reconstruction of the citizen\'s digital identity and its classification with respect to the ontology, the design and implementation of a legal document management system for the selective access to personalized norm versions.}, keywords = {Data Versioning, EGov Project}, pubstate = {published}, tppubtype = {inproceedings} } This paper describes an ongoing research project involving the implementation of e-Government services on the Semantic Web. In particular, the project is aimed at managing the ""digital identity"" of citizens on the Internet, enabling them to benefit from ""personalized"" versions of the online services offered by the Public Administration, which can improve and optimize their involvement in the e-Governance process. The kind of service we will consider is the selective access to norm texts available on Web repositories. The project requires the definition and maintenance of a citizen's ontology, the semantic markup and versioning of the stored norm texts which takes into account the actual applicability to different classes of citizens, the definition and enactment of Web services for the reconstruction of the citizen's digital identity and its classification with respect to the ontology, the design and implementation of a legal document management system for the selective access to personalized norm versions. |
2003 | |
27. | F. Mandreoli, R. Martoglia, P. Tiberio (2003): Un Metodo per il Riconoscimento di Duplicati in Collezioni di Documenti. Atti dell'Undicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2003 (SEBD 2003), pp. 131-146, Cetraro, Italy, 2003. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search) @inproceedings{pub2, title = {Un Metodo per il Riconoscimento di Duplicati in Collezioni di Documenti}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/sebd03.pdf}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, booktitle = {Atti dell'Undicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2003 (SEBD 2003)}, pages = {131-146}, address = {Cetraro, Italy}, abstract = {I recenti avanzamenti nella potenza di calcolo e nelle telecomunicazioni hanno creato le giuste condizioni per la diffusione globale di enormi moli di informazioni elettroniche e di nuovi strumenti per l\'analisi del loro contenuto, sollevando problemi di information overload e, in particolare, di duplicate detection. I duplicati, cioe\' documenti molto simili che contengono approssimativamente le stesse informazioni, degradano l\'efficacia e l\'efficienza delle ricerche e, spesso, costituiscono anche violazioni di copyright. In questo articolo introduciamo DANCER (Document ANalysis and Comparison ExpeRt), un sistema completo di duplicate detection che sfrutta idee innovative nell\'ambito dell\'information retrieval per l\'identificazione dei documenti duplicati, utilizzando algoritmi e misure di similarita\' inedite in questo campo e sufficientemente fini da ottenere una buona efficacia nella maggior parte delle applicazioni. Inoltre, il sistema propone diverse nuove tecniche di data reduction che permettono di ridurre sia il tempo di esecuzione che lo spazio richiesto per la memorizzazione dei dati, senza compromettere la buona qualita\' dei risultati.}, keywords = {Approximate search}, pubstate = {published}, tppubtype = {inproceedings} } I recenti avanzamenti nella potenza di calcolo e nelle telecomunicazioni hanno creato le giuste condizioni per la diffusione globale di enormi moli di informazioni elettroniche e di nuovi strumenti per l'analisi del loro contenuto, sollevando problemi di information overload e, in particolare, di duplicate detection. I duplicati, cioe' documenti molto simili che contengono approssimativamente le stesse informazioni, degradano l'efficacia e l'efficienza delle ricerche e, spesso, costituiscono anche violazioni di copyright. In questo articolo introduciamo DANCER (Document ANalysis and Comparison ExpeRt), un sistema completo di duplicate detection che sfrutta idee innovative nell'ambito dell'information retrieval per l'identificazione dei documenti duplicati, utilizzando algoritmi e misure di similarita' inedite in questo campo e sufficientemente fini da ottenere una buona efficacia nella maggior parte delle applicazioni. Inoltre, il sistema propone diverse nuove tecniche di data reduction che permettono di ridurre sia il tempo di esecuzione che lo spazio richiesto per la memorizzazione dei dati, senza compromettere la buona qualita' dei risultati. |
26. | F. Mandreoli, R. Martoglia, P. Tiberio (2003): Exploiting multi-lingual text potentialities in EBMT systems. Proceedings of the 13th IEEE International Workshop on Research Issues in Data Engineering: Multi Lingual Information Management, March 2003 (IEEE RIDE-MLIM 2003), pp. 9-15, Hyderabad, India, 2003. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, EBMT) @inproceedings{pub5, title = {Exploiting multi-lingual text potentialities in EBMT systems}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/ride03.pdf}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 13th IEEE International Workshop on Research Issues in Data Engineering: Multi Lingual Information Management, March 2003 (IEEE RIDE-MLIM 2003)}, pages = {9-15}, address = {Hyderabad, India}, abstract = {Translating documents from a source to a target language is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream. Among the several types of approaches in Machine Translation (MT), one of the most promising paradigms is Example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar source-language material into the target language. In this paper we introduce EXTRA (EXample-based TRanslation Assistant), a complete EBMT system that exploits some innovative ideas in information retrieval and multilingual text management to effectively and efficiently extract useful suggestions from past translations and present them to the translator. This work has been developed as a joint work with the LOGOS group, a worldwide leader in multilingual document translation.}, keywords = {Approximate search, EBMT}, pubstate = {published}, tppubtype = {inproceedings} } Translating documents from a source to a target language is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream. Among the several types of approaches in Machine Translation (MT), one of the most promising paradigms is Example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar source-language material into the target language. In this paper we introduce EXTRA (EXample-based TRanslation Assistant), a complete EBMT system that exploits some innovative ideas in information retrieval and multilingual text management to effectively and efficiently extract useful suggestions from past translations and present them to the translator. This work has been developed as a joint work with the LOGOS group, a worldwide leader in multilingual document translation. |
25. | F. Mandreoli, F. Grandi (2003): A Formal Model for Temporal Schema Versioning in Object-Oriented Databases. Data & Knowledge Engineering (DKE), 46 (2), pp. 123-167, 2003. (Type: Journal Article | BibTeX | Tags: Schema Versioning) @article{pub22, title = {A Formal Model for Temporal Schema Versioning in Object-Oriented Databases}, author = {F. Mandreoli and F. Grandi}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, journal = {Data & Knowledge Engineering (DKE)}, volume = {46}, number = {2}, pages = {123-167}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {article} } |
24. | F. Mandreoli, E. Franconi, A. Artale (2003): Description Logics for Modeling Dynamic Information. Logics for Emerging Applications of Databases, pp. 239-275, 2003. (Type: Incollection | BibTeX | Tags: Schema Versioning) @incollection{pub32, title = {Description Logics for Modeling Dynamic Information}, author = {F. Mandreoli and E. Franconi and A. Artale}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, booktitle = {Logics for Emerging Applications of Databases}, pages = {239-275}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {incollection} } |
23. | F. Mandreoli, F. Grandi, M. Bergonzini, P. Tiberio (2003): A temporal data model and management system for normative texts in XML format. Proceedings of the 5th ACM CIKM International Workshop on Web Information and Data Management, November 2003 (WIDM 2003), pp. 29-36, New Orleans, USA, 2003. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, norma nel tempo project) @inproceedings{pub33, title = {A temporal data model and management system for normative texts in XML format}, author = {F. Mandreoli and F. Grandi and M. Bergonzini and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/widm03.pdf}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 5th ACM CIKM International Workshop on Web Information and Data Management, November 2003 (WIDM 2003)}, pages = {29-36}, address = {New Orleans, USA}, abstract = {In this paper, we present the results of an on-going research activity concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and defined basic mechanisms for the management of norm texts. Finally, we describe a prototype management system which has been implemented and evaluated.}, keywords = {Data Versioning, norma nel tempo project}, pubstate = {published}, tppubtype = {inproceedings} } In this paper, we present the results of an on-going research activity concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and defined basic mechanisms for the management of norm texts. Finally, we describe a prototype management system which has been implemented and evaluated. |
22. | F. Mandreoli, P. Tiberio, F. Grandi, M. Bergonzini (2003): A temporal data model and system architecture for the management of normative texts. Atti dell'Undicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2003 (SEBD 2003), pp. 169-178, Cetraro (CS), Italy, 2003. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning, norma nel tempo project) @inproceedings{pub34, title = {A temporal data model and system architecture for the management of normative texts}, author = {F. Mandreoli and P. Tiberio and F. Grandi and M. Bergonzini}, url = {http://www.isgroup.unimore.it/article/sebd03b.pdf}, year = {2003}, date = {2003-01-01}, urldate = {2013-06-12}, booktitle = {Atti dell'Undicesimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2003 (SEBD 2003)}, pages = {169-178}, address = {Cetraro (CS), Italy}, abstract = {I recenti avanzamenti nella potenza di calcolo e nelle telecomunicazioni hanno creato le giuste condizioni per la diffusione globale di enormi moli di informazioni elettroniche e di nuovi strumenti per l\'analisi del loro contenuto, sollevando problemi di information overload e, in particolare, di duplicate detection. I duplicati, cioe\' documenti molto simili che contengono approssimativamente le stesse informazioni, degradano l\'efficacia e l\'efficienza delle ricerche e, spesso, costituiscono anche violazioni di copyright. In questo articolo introduciamo DANCER (Document ANalysis and Comparison ExpeRt), un sistema completo di duplicate detection che sfrutta idee innovative nell\'ambito dell\'information retrieval per l\'identificazione dei documenti duplicati, utilizzando algoritmi e misure di similarit`a inedite in questo campo e sufficientemente fini da ottenere una buona efficacia nella maggior parte delle applicazioni. Inoltre, il sistema propone diverse nuove tecniche di data reduction che permettono di ridurre sia il tempo di esecuzione che lo spazio richiesto per la memorizzazione dei dati, senza compromettere la buona qualita\' dei risultati.}, keywords = {Data Versioning, norma nel tempo project}, pubstate = {published}, tppubtype = {inproceedings} } I recenti avanzamenti nella potenza di calcolo e nelle telecomunicazioni hanno creato le giuste condizioni per la diffusione globale di enormi moli di informazioni elettroniche e di nuovi strumenti per l'analisi del loro contenuto, sollevando problemi di information overload e, in particolare, di duplicate detection. I duplicati, cioe' documenti molto simili che contengono approssimativamente le stesse informazioni, degradano l'efficacia e l'efficienza delle ricerche e, spesso, costituiscono anche violazioni di copyright. In questo articolo introduciamo DANCER (Document ANalysis and Comparison ExpeRt), un sistema completo di duplicate detection che sfrutta idee innovative nell'ambito dell'information retrieval per l'identificazione dei documenti duplicati, utilizzando algoritmi e misure di similarit`a inedite in questo campo e sufficientemente fini da ottenere una buona efficacia nella maggior parte delle applicazioni. Inoltre, il sistema propone diverse nuove tecniche di data reduction che permettono di ridurre sia il tempo di esecuzione che lo spazio richiesto per la memorizzazione dei dati, senza compromettere la buona qualita' dei risultati. |
2002 | |
21. | F. Mandreoli, R. Martoglia, P. Tiberio (2002): Searching Similar (Sub)Sentences for Example-Based Machine Translation. Atti del Decimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2002 (SEBD 2002), pp. 208-221, Portoferraio, Italy, 2002. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, EBMT) @inproceedings{pub6, title = {Searching Similar (Sub)Sentences for Example-Based Machine Translation}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/sebd02.pdf}, year = {2002}, date = {2002-01-01}, urldate = {2013-06-12}, booktitle = {Atti del Decimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2002 (SEBD 2002)}, pages = {208-221}, address = {Portoferraio, Italy}, abstract = {Translation is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream; in the past years research in this field has acquired a growing interest, making some forms of Machine Translation (MT) a reality. Among the several types of approaches in MT, one of the most promising paradigms is MAHT and, in particular, example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar sourcelanguage sentences into the target language. The basic premise is that, if a previously translated sentence occurs again, the same translation is likely to be correct. In this paper, we propose a solution based on a purely syntactic approach for searching similar sentences and parts of them in an EBMT system; the underlying similarity measure is based on the similarity between sequence of terms such that the sentences most close to a given one are those who maintain most of the original form and contents. The system efficiently retrieves and ranks the most similar sentences available and, when no useful suggestion exists, it proceeds with the retrieval of similar parts. We opted for a design that would require minimal changes to existing databases and whose similarity measure and search algorithms are completely independent from the involved languages. This work has been developed as a joint work with LOGOS S.p.A., a worldwide leader in multilingual document translation.}, keywords = {Approximate search, EBMT}, pubstate = {published}, tppubtype = {inproceedings} } Translation is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream; in the past years research in this field has acquired a growing interest, making some forms of Machine Translation (MT) a reality. Among the several types of approaches in MT, one of the most promising paradigms is MAHT and, in particular, example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar sourcelanguage sentences into the target language. The basic premise is that, if a previously translated sentence occurs again, the same translation is likely to be correct. In this paper, we propose a solution based on a purely syntactic approach for searching similar sentences and parts of them in an EBMT system; the underlying similarity measure is based on the similarity between sequence of terms such that the sentences most close to a given one are those who maintain most of the original form and contents. The system efficiently retrieves and ranks the most similar sentences available and, when no useful suggestion exists, it proceeds with the retrieval of similar parts. We opted for a design that would require minimal changes to existing databases and whose similarity measure and search algorithms are completely independent from the involved languages. This work has been developed as a joint work with LOGOS S.p.A., a worldwide leader in multilingual document translation. |
20. | F. Mandreoli, R. Martoglia, P. Tiberio (2002): A Syntactic Approach for Searching Similarities within Sentences. Proceedings of the 11th ACM International Conference on Information Knowledge and Management, November 2002 (ACM CIKM 2002), pp. 635-637, McLean, VA, USA, 2002. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Approximate search, EBMT) @inproceedings{pub7, title = {A Syntactic Approach for Searching Similarities within Sentences}, author = {F. Mandreoli and R. Martoglia and P. Tiberio}, url = {http://www.isgroup.unimore.it/article/cikm02.pdf}, year = {2002}, date = {2002-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 11th ACM International Conference on Information Knowledge and Management, November 2002 (ACM CIKM 2002)}, pages = {635-637}, address = {McLean, VA, USA}, abstract = {Textual data is the main electronic form of knowledge representation. Sentences, meant as logic units of meaningful word sequences, can be considered its backbone. In this paper, we propose a solution based on a purely syntactic approach for searching similarities within sentences, named approximate sub2sequence matching. This process being very time consuming, efficiency in retrieving the most similar parts available in large repositories of textual data is ensured by making use of new filtering techniques. As far as the design of the system is concerned, we chose a solution that allows us to deploy approximate sub2sequence matching without changing the underlying database.}, keywords = {Approximate search, EBMT}, pubstate = {published}, tppubtype = {inproceedings} } Textual data is the main electronic form of knowledge representation. Sentences, meant as logic units of meaningful word sequences, can be considered its backbone. In this paper, we propose a solution based on a purely syntactic approach for searching similarities within sentences, named approximate sub2sequence matching. This process being very time consuming, efficiency in retrieving the most similar parts available in large repositories of textual data is ensured by making use of new filtering techniques. As far as the design of the system is concerned, we chose a solution that allows us to deploy approximate sub2sequence matching without changing the underlying database. |
19. | F. Mandreoli, F. Grandi (2002): The Valid Web: un'infrastruttura XML/XSL per la gestione temporale di documenti Web. Bollettino d'Informazioni del Centro Ricerche Informatiche per i Beni Culturali (CRIBECU) (Bollettino), 1 (1), 2002. (Type: Journal Article | BibTeX | Tags: Data Versioning) @article{pub26, title = {The Valid Web: un\'infrastruttura XML/XSL per la gestione temporale di documenti Web}, author = {F. Mandreoli and F. Grandi}, year = {2002}, date = {2002-01-01}, urldate = {2013-06-12}, journal = {Bollettino d'Informazioni del Centro Ricerche Informatiche per i Beni Culturali (CRIBECU) (Bollettino)}, volume = {1}, number = {1}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {article} } |
18. | F. Mandreoli, S. Bergamaschi, D. Beneventano, et. al. (2002): Semantic Integration and Query Optimization of Heterogeneous Data Sources. Proceedings of the 1st OOIS Workshop on Efficient Web-based Information Systems, September 2002 (EWIS 2002), Montpellier, France, 2002. (Type: Inproceeding | BibTeX | Tags: Data Sharing) @inproceedings{pub31, title = {Semantic Integration and Query Optimization of Heterogeneous Data Sources}, author = {F. Mandreoli and S. Bergamaschi and D. Beneventano and et. al.}, year = {2002}, date = {2002-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st OOIS Workshop on Efficient Web-based Information Systems, September 2002 (EWIS 2002)}, address = {Montpellier, France}, keywords = {Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } |
2001 | |
17. | F. Mandreoli, F. Grandi, M. R. Scalas, J. F. Roddick (2001): Beyond Schema Versioning: A Flexible Model for Spatio-Temporal Schema selection. Geoinformatica (Geoinformatica), 5 (1), pp. 33-50, 2001. (Type: Journal Article | BibTeX | Tags: Schema Versioning) @article{pub20, title = {Beyond Schema Versioning: A Flexible Model for Spatio-Temporal Schema selection}, author = {F. Mandreoli and F. Grandi and M. R. Scalas and J. F. Roddick}, year = {2001}, date = {2001-01-01}, urldate = {2013-06-12}, journal = {Geoinformatica (Geoinformatica)}, volume = {5}, number = {1}, pages = {33-50}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {article} } |
16. | F. Mandreoli, F. Grandi (2001): Effective Representation and Efficient Management of Indeterminate Dates. Proceedings of the 8th International Symposium on Temporal Representation and Reasoning, June 2001 (TIME 2001), Cividale del Friuli, Italy, 2001. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub27, title = {Effective Representation and Efficient Management of Indeterminate Dates}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/time01.pdf}, year = {2001}, date = {2001-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 8th International Symposium on Temporal Representation and Reasoning, June 2001 (TIME 2001)}, address = {Cividale del Friuli, Italy}, abstract = {Management of indeterminate temporal expressions is useful in a wide range of applications, from designing and querying temporal databases to knowledge representation and reasoning in artificial intelligence. In this paper, we focus on the representation and management of indeterminate dates, corresponding to a common use of temporal indeterminacy which can be found in (historical) texts written in natural language, as in expressions like: around 1624, near the end of the fourteenth century, etc. In this context, we adapt and improve the probabilistic approach designed for the TSQL2 language and further developed by Dyreson and Snodgrass, and show how it can be effectively and efficiently adopted for the management of indeterminate dates.}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } Management of indeterminate temporal expressions is useful in a wide range of applications, from designing and querying temporal databases to knowledge representation and reasoning in artificial intelligence. In this paper, we focus on the representation and management of indeterminate dates, corresponding to a common use of temporal indeterminacy which can be found in (historical) texts written in natural language, as in expressions like: around 1624, near the end of the fourteenth century, etc. In this context, we adapt and improve the probabilistic approach designed for the TSQL2 language and further developed by Dyreson and Snodgrass, and show how it can be effectively and efficiently adopted for the management of indeterminate dates. |
15. | F. Mandreoli, F. Grandi (2001): Codifica XML e Gestione di Informazione Temporale in Fonti Storiche Digitalizzate di Grandi Dimensioni. Atti del XXXIX Congresso Annuale AIC, September 2001 (AICA 2001), Como, Italy, 2001. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub28, title = {Codifica XML e Gestione di Informazione Temporale in Fonti Storiche Digitalizzate di Grandi Dimensioni}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/aica01.pdf}, year = {2001}, date = {2001-01-01}, urldate = {2013-06-12}, booktitle = {Atti del XXXIX Congresso Annuale AIC, September 2001 (AICA 2001)}, address = {Como, Italy}, abstract = {The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the encoding of temporal semantics in the digital version of historical documents. The present research is included in the context of a project aimed at publishing on the Web an XML-based electronic edition of the Repetti dictionary (XIX century), extremely interesting for historical and archeological studies of the Tuscany Middle Ages. In particular, we introduce a proposal for the uniform encoding and classification of temporal information embedded in textual sources which are characterized by indeterminancy,multiple granularities and calendars. Our proposal is based on the extension of the probabilistic approach (a la TSQL2) to indeterminancy, with the introduction of piecewise-constant probability distributions, which are semantically correct and particularly efficient in their management. The paper also contains a brief description of two tools whose prototypes are carried on: a user-friendly tool for the computer-aided encoding of temporal XML documents and a system for the management of XML documents consisting in a temporal search engine accessible via standardWeb browsers.}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the encoding of temporal semantics in the digital version of historical documents. The present research is included in the context of a project aimed at publishing on the Web an XML-based electronic edition of the Repetti dictionary (XIX century), extremely interesting for historical and archeological studies of the Tuscany Middle Ages. In particular, we introduce a proposal for the uniform encoding and classification of temporal information embedded in textual sources which are characterized by indeterminancy,multiple granularities and calendars. Our proposal is based on the extension of the probabilistic approach (a la TSQL2) to indeterminancy, with the introduction of piecewise-constant probability distributions, which are semantically correct and particularly efficient in their management. The paper also contains a brief description of two tools whose prototypes are carried on: a user-friendly tool for the computer-aided encoding of temporal XML documents and a system for the management of XML documents consisting in a temporal search engine accessible via standardWeb browsers. |
14. | F. Mandreoli, F. Grandi (2001): The ""XML/Repetti"" Project: Encoding and Manipulation of Temporal Information in Historical Text Sources. Proceedings of the International Cultural Heritage Informatics Meeting, September 2001 (ICHIM 2001), Milano, Italy, 2001. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub29, title = {The \"\"XML/Repetti\"\" Project: Encoding and Manipulation of Temporal Information in Historical Text Sources}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/ichim01.pdf}, year = {2001}, date = {2001-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the International Cultural Heritage Informatics Meeting, September 2001 (ICHIM 2001)}, address = {Milano, Italy}, abstract = {The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the encoding of temporal semantics in the digital version of historical documents. Since written sources have often the same importance as material evidence in medieval archaeology, our approach can be applied to the development of tools for the support of archaeological research. In previous work, we developed an XML/XSL infrastructure called The Valid Web for the definition and management of historical information within Web documents. In this paper we describe the application and extension of such an approach to the realization of the electronic version of Repetti\'s historical-geographical dictionary of Tuscany. The extension concerns the uniform management of temporal indeterminacy, the use of multiple calendars and granularities and the proposed solutions have been inspired by similar research done for temporal query languages. From the user viewpoint, the proposed XML extensions allow the addition of historical metainformation to the encoded text sources and their intelligent temporal navigation via standard Web browsers. The project also involves the definition of optimized search algorithms, storage and temporal indexing of XML-encoded Repetti\'s Dictionary items, implementation of a prototype. As a byproduct, also a tool for computer-aided temporal XML-encoding of text sources will be developed to be used by Cultural Heritage operators (e.g. archaeology researchers).}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the encoding of temporal semantics in the digital version of historical documents. Since written sources have often the same importance as material evidence in medieval archaeology, our approach can be applied to the development of tools for the support of archaeological research. In previous work, we developed an XML/XSL infrastructure called The Valid Web for the definition and management of historical information within Web documents. In this paper we describe the application and extension of such an approach to the realization of the electronic version of Repetti's historical-geographical dictionary of Tuscany. The extension concerns the uniform management of temporal indeterminacy, the use of multiple calendars and granularities and the proposed solutions have been inspired by similar research done for temporal query languages. From the user viewpoint, the proposed XML extensions allow the addition of historical metainformation to the encoded text sources and their intelligent temporal navigation via standard Web browsers. The project also involves the definition of optimized search algorithms, storage and temporal indexing of XML-encoded Repetti's Dictionary items, implementation of a prototype. As a byproduct, also a tool for computer-aided temporal XML-encoding of text sources will be developed to be used by Cultural Heritage operators (e.g. archaeology researchers). |
13. | F. Mandreoli, S. Bergamaschi, D. Beneventano (2001): Extensional Knowledge for Semantic Query Optimization in a Mediator Based System. Foundation of Models for Information Integration, September 2001 (FMII 2001), Viterbo, Italy, 2001. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Sharing) @inproceedings{pub30, title = {Extensional Knowledge for Semantic Query Optimization in a Mediator Based System}, author = {F. Mandreoli and S. Bergamaschi and D. Beneventano}, url = {http://www.isgroup.unimore.it/article/fmii01.pdf}, year = {2001}, date = {2001-01-01}, urldate = {2013-06-12}, booktitle = {Foundation of Models for Information Integration, September 2001 (FMII 2001)}, address = {Viterbo, Italy}, abstract = {Query processing in global information systems integrating multiple heterogeneous sources is a challenging issue in relation to the effective extraction of information available on-line. In this paper we propose intelligent, tool-supported techniques for querying global information systems integrating both structured and semistructured data sources. The techniques have been developed in the environment of a data integration, wrapper/mediator based system, MOMIS, and try to achieve the goal of optimized query reformulation w.r.t local sources. The developed techniques rely on the availability of integration knowledge whose semantics is expressed in terms of description logics. Integration knowledge includes local source schemata, a virtual mediated schema and its mapping descriptions, that is semantic mappings w.r.t. the underlying sources both at the intensional and extensional level. Mapping descriptions, obtained as a result of the semi-automatic integration process of multiple heterogeneous sources developed for the MOMIS system, include, unlike previous data integration proposals, extensional intra/interschema knowledge. Extensional knowledge is exploited to perform semantic query optimization in a mediator based system as it allows to devise an optimized query reformulation method. The techniques are under development in the MOMIS system but can be applied, in general, to data integration systems including extensional intra/interschema knowledge in mapping descriptions.}, keywords = {Data Sharing}, pubstate = {published}, tppubtype = {inproceedings} } Query processing in global information systems integrating multiple heterogeneous sources is a challenging issue in relation to the effective extraction of information available on-line. In this paper we propose intelligent, tool-supported techniques for querying global information systems integrating both structured and semistructured data sources. The techniques have been developed in the environment of a data integration, wrapper/mediator based system, MOMIS, and try to achieve the goal of optimized query reformulation w.r.t local sources. The developed techniques rely on the availability of integration knowledge whose semantics is expressed in terms of description logics. Integration knowledge includes local source schemata, a virtual mediated schema and its mapping descriptions, that is semantic mappings w.r.t. the underlying sources both at the intensional and extensional level. Mapping descriptions, obtained as a result of the semi-automatic integration process of multiple heterogeneous sources developed for the MOMIS system, include, unlike previous data integration proposals, extensional intra/interschema knowledge. Extensional knowledge is exploited to perform semantic query optimization in a mediator based system as it allows to devise an optimized query reformulation method. The techniques are under development in the MOMIS system but can be applied, in general, to data integration systems including extensional intra/interschema knowledge in mapping descriptions. |
2000 | |
12. | F. Mandreoli, F. Grandi, M. R. Scalas (2000): A Generalized Modeling Framework for Schema Versioning Support. Proceedings of the 11th Australasian Database Conference, January 2000 (ADC 2000), Camberrra, Australia, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub15, title = {A Generalized Modeling Framework for Schema Versioning Support}, author = {F. Mandreoli and F. Grandi and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/adc00.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 11th Australasian Database Conference, January 2000 (ADC 2000)}, address = {Camberrra, Australia}, abstract = {Advanced object-oriented applications require the management of schema versions, in order to cope with changes in the structure of the stored data. Two types of versioning have been separately considered so far: branching and temporal. The former arose in application domains like CAD/CAM and software engineering, where different solutions have been proposed to support design schema versions (consolidated versions). The latter concerns temporal databases, where some works considered temporal schema versioning to fulfil advanced needs of other typical objectoriented applications like GIS and the multimedia ones. In this work, we propose a general model which integrates the two approaches by supporting both design and temporal schema versions. The model is provided with a complete set of schema change primitives for full-fledged version manipulation whose semantics is described in the paper.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } Advanced object-oriented applications require the management of schema versions, in order to cope with changes in the structure of the stored data. Two types of versioning have been separately considered so far: branching and temporal. The former arose in application domains like CAD/CAM and software engineering, where different solutions have been proposed to support design schema versions (consolidated versions). The latter concerns temporal databases, where some works considered temporal schema versioning to fulfil advanced needs of other typical objectoriented applications like GIS and the multimedia ones. In this work, we propose a general model which integrates the two approaches by supporting both design and temporal schema versions. The model is provided with a complete set of schema change primitives for full-fledged version manipulation whose semantics is described in the paper. |
11. | F. Mandreoli, E. Franconi, F. Grandi (2000): A General Framework for Evolving Schemata Support. Atti dell'ottavo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2000 (SEBD 2000), pp. 371-386, L'Aquila, Italy, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub16, title = {A General Framework for Evolving Schemata Support}, author = {F. Mandreoli and E. Franconi and F. Grandi}, url = {http://www.isgroup.unimore.it/article/sebd00a.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Atti dell'ottavo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2000 (SEBD 2000)}, pages = {371-386}, address = {L'Aquila, Italy}, abstract = {In this paper a semantic approach for the specification and the management of databases with evolving schemata is introduced. It is shown how a general object-oriented model for schema-versioning and evolution can be formalised; how the semantics of schema change operations can be defined; how interesting reasoning tasks can be supported, based on an encoding in description logics. }, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } In this paper a semantic approach for the specification and the management of databases with evolving schemata is introduced. It is shown how a general object-oriented model for schema-versioning and evolution can be formalised; how the semantics of schema change operations can be defined; how interesting reasoning tasks can be supported, based on an encoding in description logics. |
10. | F. Mandreoli, E. Franconi, F. Grandi (2000): A Semantic Approach for Schema Evolution and Versioning in Object-Oriented Databases. Proceedings of the 6th International Conference on Rules and Objects in Databases, July 2000 (DOOD 2000), London, UK, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub17, title = {A Semantic Approach for Schema Evolution and Versioning in Object-Oriented Databases}, author = {F. Mandreoli and E. Franconi and F. Grandi}, url = {http://www.isgroup.unimore.it/article/dood00.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 6th International Conference on Rules and Objects in Databases, July 2000 (DOOD 2000)}, address = {London, UK}, abstract = {In this paper a semantic approach for the specification and the management of databases with evolving schemata is introduced. It is shown how a general object-oriented model for schema versioning and evolution can be formalized; how the semantics of schema change operations can be defined; how interesting reasoning tasks can be supported, based on an encoding in description logics.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } In this paper a semantic approach for the specification and the management of databases with evolving schemata is introduced. It is shown how a general object-oriented model for schema versioning and evolution can be formalized; how the semantics of schema change operations can be defined; how interesting reasoning tasks can be supported, based on an encoding in description logics. |
9. | F. Mandreoli, E. Franconi, F. Grandi (2000): Schema evolution and versioning: a logical and computational characterisation. Database schema evolution and meta-modeling, September 2000 (DEMM 2000), Dagstuhl, Germany, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub18, title = {Schema evolution and versioning: a logical and computational characterisation}, author = {F. Mandreoli and E. Franconi and F. Grandi}, url = {http://www.isgroup.unimore.it/article/demm00.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Database schema evolution and meta-modeling, September 2000 (DEMM 2000)}, address = {Dagstuhl, Germany}, abstract = {In this paper we study the logical and computational properties of schema evolution and versioning support in object-oriented databases. To this end, we present the formalisation of a general model for an object base with evolving schemata and define the semantics of the provided schema change operations. We will then sketch how the encoding of such a framework in a suitable Description Logic will allow the introduction and solution of interesting reasoning tasks at global database and single schema version levels.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we study the logical and computational properties of schema evolution and versioning support in object-oriented databases. To this end, we present the formalisation of a general model for an object base with evolving schemata and define the semantics of the provided schema change operations. We will then sketch how the encoding of such a framework in a suitable Description Logic will allow the introduction and solution of interesting reasoning tasks at global database and single schema version levels. |
8. | F. Mandreoli, J. F. Roddick, et. al. (2000): Evolution and Change in Data Management - Issues and Directions. SIGMOD Record (SIGMOD), 29 (1), pp. 21-25, 2000. (Type: Journal Article | BibTeX | Tags: Schema Versioning) @article{pub19, title = {Evolution and Change in Data Management - Issues and Directions}, author = {F. Mandreoli and J. F. Roddick and et. al.}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, journal = {SIGMOD Record (SIGMOD)}, volume = {29}, number = {1}, pages = {21-25}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {article} } |
7. | F. Mandreoli, F. Grandi (2000): The Valid Web. Proceedings of the 7th International Conference on Extending Database Technology, March 2000 (EDBT 2000), Costance, Germany, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub23, title = {The Valid Web}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/edbt00.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 7th International Conference on Extending Database Technology, March 2000 (EDBT 2000)}, address = {Costance, Germany}, abstract = {The Valid Web is a software prototype implementing temporal extensions of the World Wide Web. The temporal dimension of interest is the valid time, which represents the evolution of data with respect of the real-world (or virtual) environment they describe. The prototype consists of a Web site browsablewith MS Internet Explorer 5 (Ie5), which allows the selective processing of HTML/XML documents containing historical information or temporal data. The base techniques employed in the prototype design and development, which derive from the temporal database theory, are the adoption of data timestamping and temporal selection operators for the creation and management of Web pages, respectively. The implementation relies on an XML/XSL infrastructure for the support of valid time whose main features are the following: the aoption of new XML tags for document timestamping; an XML schema to define well-formed temporal documents; an XSL stylesheet for selective filtering of temporal documents; the introduction of a validity context for temporal browsing (navigation and querying); a friendly user-interface for the management of the validity context.}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } The Valid Web is a software prototype implementing temporal extensions of the World Wide Web. The temporal dimension of interest is the valid time, which represents the evolution of data with respect of the real-world (or virtual) environment they describe. The prototype consists of a Web site browsablewith MS Internet Explorer 5 (Ie5), which allows the selective processing of HTML/XML documents containing historical information or temporal data. The base techniques employed in the prototype design and development, which derive from the temporal database theory, are the adoption of data timestamping and temporal selection operators for the creation and management of Web pages, respectively. The implementation relies on an XML/XSL infrastructure for the support of valid time whose main features are the following: the aoption of new XML tags for document timestamping; an XML schema to define well-formed temporal documents; an XSL stylesheet for selective filtering of temporal documents; the introduction of a validity context for temporal browsing (navigation and querying); a friendly user-interface for the management of the validity context. |
6. | F. Mandreoli, F. Grandi (2000): Un'infrastruttura XML/XSL per la gestione temporale di documenti e dati in ambiente Web. Atti dell'ottavo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2000 (SEBD 2000), pp. 227-242, L'Aquila, Italy, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub24, title = {Un\'infrastruttura XML/XSL per la gestione temporale di documenti e dati in ambiente Web}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/sebd00b.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Atti dell'ottavo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 2000 (SEBD 2000)}, pages = {227-242}, address = {L'Aquila, Italy}, abstract = {In questo lavoro presentiamo una estensione temporale del Web per il supporto e la gestione del tempo di validita\', definita attraverso una infrastruttura XML/XSL. Tale estensione consente la definizione esplicita di informazione temporale all\'interno di pagine Web (documenti HTML o XML), i cui contenuti possono cosi\' essere acceduti e fruiti selettivamente in base alla loro validita\'. Con la soluzione proposta, agendo su di un contesto di navigazione temporale, e\' possibile \"\"viaggiare mnel tempo\"\" in un ambiente virtuale dato, attraverso un qualunque browser che riconosca codice XML. Dal punto di vista dell\'utente tale funzionalita\' consente, ad esempio, di ritagliare percorsi di visita personalizzata, circoscritti ad una particolare epoca, all\'interno di un museo virtuale o di una biblioteca storica digitale, oppure di visualizzare l\'evoluzione attraverso epoche successive di un sito archeologico, oppure anche di accedere selettivamente a serie storiche di dati (es. quotazioni di borsa), edizioni passate di pubblicazioni on-line e quanto altro possa essere organizzato secondo la dimensione temporale. In aggiunta a tali funzionalita\' di navigazione, l\'infrastruttura proposta puo\' anche essere impiegata, in maniera immediata, per la gestione di dati semistrutturati codificati in XML, ponendo le basi per la gestione di dati temporali e lo sviluppo di linguaggi di interrogazione temporale per XML. Le estensioni del Web proposte sono state sperimentate su di un prototipo software che mostra due esempi applicativi: la realizzazione di un sito Web temporale (museo virtuale) e la gestione di dati XML temporali con funzionalita\' di query di tipo TSQL2.}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } In questo lavoro presentiamo una estensione temporale del Web per il supporto e la gestione del tempo di validita', definita attraverso una infrastruttura XML/XSL. Tale estensione consente la definizione esplicita di informazione temporale all'interno di pagine Web (documenti HTML o XML), i cui contenuti possono cosi' essere acceduti e fruiti selettivamente in base alla loro validita'. Con la soluzione proposta, agendo su di un contesto di navigazione temporale, e' possibile ""viaggiare mnel tempo"" in un ambiente virtuale dato, attraverso un qualunque browser che riconosca codice XML. Dal punto di vista dell'utente tale funzionalita' consente, ad esempio, di ritagliare percorsi di visita personalizzata, circoscritti ad una particolare epoca, all'interno di un museo virtuale o di una biblioteca storica digitale, oppure di visualizzare l'evoluzione attraverso epoche successive di un sito archeologico, oppure anche di accedere selettivamente a serie storiche di dati (es. quotazioni di borsa), edizioni passate di pubblicazioni on-line e quanto altro possa essere organizzato secondo la dimensione temporale. In aggiunta a tali funzionalita' di navigazione, l'infrastruttura proposta puo' anche essere impiegata, in maniera immediata, per la gestione di dati semistrutturati codificati in XML, ponendo le basi per la gestione di dati temporali e lo sviluppo di linguaggi di interrogazione temporale per XML. Le estensioni del Web proposte sono state sperimentate su di un prototipo software che mostra due esempi applicativi: la realizzazione di un sito Web temporale (museo virtuale) e la gestione di dati XML temporali con funzionalita' di query di tipo TSQL2. |
5. | F. Mandreoli, F. Grandi (2000): The Valid Web: An XML/XSL Infrastructure for Temporal Management of Web Documents. Proceedings of the 1st International Conference on Advances in Information Systems, October 2000 (ADVIS 2000), Izmir, Turkey, 2000. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Data Versioning) @inproceedings{pub25, title = {The Valid Web: An XML/XSL Infrastructure for Temporal Management of Web Documents}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/advis00.pdf}, year = {2000}, date = {2000-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the 1st International Conference on Advances in Information Systems, October 2000 (ADVIS 2000)}, address = {Izmir, Turkey}, abstract = {In this paper we present a temporal extension of the World Wide Web based on a complete XML/XSL infrastructure to support valid time. The proposed technique enables the explicit definition of temporal information within HTML/XML documents, whose contents can then be selectively accessed according to their valid time. By acting on a navigation validity context, the proposed solution makes it possible to \"\"travel in time\"\" in a given virtual environment with any XML-compliant browser; this allows, for instance, to cut personalized visit routes for a specific epoch in a virtual museum or a digital historical library, to visualize the evolution of an archeological site through sucessive ages, to selectively access past issues of magazines, to browse historical time series (e.g. stock quote archives), etc. The proposed Web extensions have been tested on a demo prototype showing, as application example, the functionalities of a temporal museum.}, keywords = {Data Versioning}, pubstate = {published}, tppubtype = {inproceedings} } In this paper we present a temporal extension of the World Wide Web based on a complete XML/XSL infrastructure to support valid time. The proposed technique enables the explicit definition of temporal information within HTML/XML documents, whose contents can then be selectively accessed according to their valid time. By acting on a navigation validity context, the proposed solution makes it possible to ""travel in time"" in a given virtual environment with any XML-compliant browser; this allows, for instance, to cut personalized visit routes for a specific epoch in a virtual museum or a digital historical library, to visualize the evolution of an archeological site through sucessive ages, to selectively access past issues of magazines, to browse historical time series (e.g. stock quote archives), etc. The proposed Web extensions have been tested on a demo prototype showing, as application example, the functionalities of a temporal museum. |
1999 | |
4. | F. Mandreoli, F. Grandi, M. R. Scalas (1999): Un Nuovo Modello per la Gestione di Versioni di Progetto e Versioni Temporali di Schema nelle Basi di Dati Object-Oriented. Atti del Settimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 1999 (SEBD 1999), pp. 403-417, Como, Italy, 1999. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub12, title = {Un Nuovo Modello per la Gestione di Versioni di Progetto e Versioni Temporali di Schema nelle Basi di Dati Object-Oriented}, author = {F. Mandreoli and F. Grandi and M. R. Scalas}, url = {http://www.isgroup.unimore.it/article/sebd99.pdf}, year = {1999}, date = {1999-01-01}, urldate = {2013-06-12}, booktitle = {Atti del Settimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, June 1999 (SEBD 1999)}, pages = {403-417}, address = {Como, Italy}, abstract = {Il problema della gestione di versioni di schema (schema versioning) nelle basi di dati object-oriented) e' stato studiato nell'ambito di due principali filoni di ricerca. Il primo di essi riguarda sistemi statici (non temporali) per i quali esistono numerose soluzioni per il supporto di versioni progettuali di schema (versioni consolidate) sulla base delle esigenze di domini applicativi quali il CAD/CAM e l'ingegneria del software. Il secondo filone di ricerca riguarda invece le basi di dati temporali. In questo ambito, per soddisfare le richieste avanzate da altre tipiche applicazioni object-oriented, quali GIS e multimediale, sono state presentate alcune proposte di gestione di versioni temporali di schema. In questo lavoro ci proponiamo di integrare i due approcci introducendo un modello generalizzato orientato agli oggetti per la gestione di versioni di schema sia progettuali sia temporali. Il modello proposto estende le possibilita' applicative di un singolo sistema arricchendo l'espressivita' delle versioni e le potenzialita' dischiuse dal loro trattamento. A tal fine e' stato formalmente definito un insieme completo di primitive per il cambiamento di schema il cui utilizzo sara' esemplificato nel lavoro.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } Il problema della gestione di versioni di schema (schema versioning) nelle basi di dati object-oriented) e' stato studiato nell'ambito di due principali filoni di ricerca. Il primo di essi riguarda sistemi statici (non temporali) per i quali esistono numerose soluzioni per il supporto di versioni progettuali di schema (versioni consolidate) sulla base delle esigenze di domini applicativi quali il CAD/CAM e l'ingegneria del software. Il secondo filone di ricerca riguarda invece le basi di dati temporali. In questo ambito, per soddisfare le richieste avanzate da altre tipiche applicazioni object-oriented, quali GIS e multimediale, sono state presentate alcune proposte di gestione di versioni temporali di schema. In questo lavoro ci proponiamo di integrare i due approcci introducendo un modello generalizzato orientato agli oggetti per la gestione di versioni di schema sia progettuali sia temporali. Il modello proposto estende le possibilita' applicative di un singolo sistema arricchendo l'espressivita' delle versioni e le potenzialita' dischiuse dal loro trattamento. A tal fine e' stato formalmente definito un insieme completo di primitive per il cambiamento di schema il cui utilizzo sara' esemplificato nel lavoro. |
3. | F. Mandreoli, F. Grandi, M. R. Scalas, J. F. Roddick (1999): Towards a Model for Spatio-Temporal Schema Selection. Proceedings of the DEXA Workshop on Spatio-Temporal Data Models and Languages, August 1999 (STDM 1999), Florence, Italy, 1999. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub13, title = {Towards a Model for Spatio-Temporal Schema Selection}, author = {F. Mandreoli and F. Grandi and M. R. Scalas and J. F. Roddick}, url = {http://www.isgroup.unimore.it/article/stdml99.pdf}, year = {1999}, date = {1999-01-01}, urldate = {2013-06-12}, booktitle = { Proceedings of the DEXA Workshop on Spatio-Temporal Data Models and Languages, August 1999 (STDM 1999)}, address = {Florence, Italy}, abstract = {Schema versioning provides a mechanism for handling change in the structure of database systems and has been investigated widely, both in the context of static and temporal databases. With the growing interest in spatial and spatio-temporal data as well as the mechanisms for holding such data, the spatial context within which data is formatted also becomes an issue. This paper presents a generalised model that accommodates schema versioning within static, temporal, spatial and spatio-temporal relational and object-oriented databases.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } Schema versioning provides a mechanism for handling change in the structure of database systems and has been investigated widely, both in the context of static and temporal databases. With the growing interest in spatial and spatio-temporal data as well as the mechanisms for holding such data, the spatial context within which data is formatted also becomes an issue. This paper presents a generalised model that accommodates schema versioning within static, temporal, spatial and spatio-temporal relational and object-oriented databases. |
2. | F. Mandreoli, F. Grandi (1999): ODMG language extensions for generalized schema versioning support. Proceedings of the Fist ER Int'l Workshop on Evolution and Change in Data Management, November 1999 (ECDM 1999), Paris, France, 1999. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub14, title = {ODMG language extensions for generalized schema versioning support}, author = {F. Mandreoli and F. Grandi}, url = {http://www.isgroup.unimore.it/article/ecdm99.pdf}, year = {1999}, date = {1999-01-01}, urldate = {2013-06-12}, booktitle = {Proceedings of the Fist ER Int'l Workshop on Evolution and Change in Data Management, November 1999 (ECDM 1999)}, address = {Paris, France}, abstract = {The management of different schema versions is required in long-lived database systems to accomplish data structural changes and represent their history. Once a suitable data model for schema versioning support has been defined, appropriate extensions must also be introduced in the data definition and manipulation languages. Such an extension is aimed at making the versioning facilities available at user-interface level and is the basis for the development of advanced multi-schema applications. In this paper we present extensions to the definition and manipulation language of the standard object-oriented data model ODMG for a generalized schema versioning support. To this end, two versioning modalities will be considered in a single powerful system: temporal versioning and management of alternative design versions. As far as the temporal components are concerned, the proposed extensions of ODL and OQL will be consistent with the TSQL-temporal query language.}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } The management of different schema versions is required in long-lived database systems to accomplish data structural changes and represent their history. Once a suitable data model for schema versioning support has been defined, appropriate extensions must also be introduced in the data definition and manipulation languages. Such an extension is aimed at making the versioning facilities available at user-interface level and is the basis for the development of advanced multi-schema applications. In this paper we present extensions to the definition and manipulation language of the standard object-oriented data model ODMG for a generalized schema versioning support. To this end, two versioning modalities will be considered in a single powerful system: temporal versioning and management of alternative design versions. As far as the temporal components are concerned, the proposed extensions of ODL and OQL will be consistent with the TSQL-temporal query language. |
1998 | |
1. | F. Mandreoli (1998): Temporal Schema Versioning for OODBs (abstract). Integrating Spatial and Temporal Databases, December 1998 (Dagstuhl-Seminar-Report), Dagstuhl, Germany, 1998. (Type: Inproceeding | Links | BibTeX | Tags: Schema Versioning) @inproceedings{pub11, title = {Temporal Schema Versioning for OODBs (abstract)}, author = {F. Mandreoli}, url = {http://www.isgroup.unimore.it/article/dagstuhl98.pdf}, year = {1998}, date = {1998-01-01}, urldate = {2013-06-12}, booktitle = {Integrating Spatial and Temporal Databases, December 1998 (Dagstuhl-Seminar-Report)}, address = {Dagstuhl, Germany}, keywords = {Schema Versioning}, pubstate = {published}, tppubtype = {inproceedings} } |