Show simple item record

dc.contributor.authorO'SULLIVAN, DECLANen
dc.contributor.authorBRENNAN, ROBen
dc.contributor.authorWALSHE, BRIANen
dc.date.accessioned2013-08-12T14:00:01Z
dc.date.available2013-08-12T14:00:01Z
dc.date.createdApril 16-20, 2012en
dc.date.issued2012en
dc.date.submitted2012en
dc.identifier.citationBrian Walshe, Rob Brennan, Declan O'Sullivan, Correspondence Pattern Attribute Selection for Consumption of Federated Data Sources, Distributed Autonomous Network Management Systems/Network Operation and Management Symposium, Maui, Hawaii,. USA, April 16-20, 2012, 2012, 1234 - 1240en
dc.identifier.otherYen
dc.identifier.urihttp://hdl.handle.net/2262/66995
dc.descriptionPUBLISHEDen
dc.descriptionMaui, Hawaii,. USAen
dc.description.abstractWhen consuming data from federated domains, it is often necessary to identify the relationships that exist between the data schemas used in each domain. Discovering the exact nature of these relationships is difficult due to data set schema heterogeneity. Prior work has focused on inter-domain class equivalence. However it is not always possible to find an equivalent class in both schemas. For example, when instances are modeled as classes in one domain (e.g. router type) but as the attribute values of a single class in the other domain (e.g. router interface). This paper investigates whether when classifying instances in one data set against a second schema, it may be more useful to use some attribute (or attribute group) other than the original class type, to perform this classification. A machine- learning based classification approach to appropriate attribute selection is presented and its operation is evaluated using two large data-sets available on the web as Linked Data. The classification problem is compounded by the less formal semantics of Linked Data when compared to full ontologies but this also highlights the strength of our approach to dealing with noisy or under-specified data-sets and schemas. The experimental results show that our attribute selection approach is capable of discovering appropriate mappings for cases where the correspondence is conditioned on one attribute and that information gain provides a suitable scoring function for selection of correspondence patterns to describe these complex attribute- based mappings.en
dc.description.sponsorshipScience Foundation Ireland FAME Strategic Research Cluster (award No. 08/SRC/I1408)en
dc.format.extent1234en
dc.format.extent1240en
dc.language.isoenen
dc.rightsYen
dc.subject.otherData schemas
dc.titleCorrespondence Pattern Attribute Selection for Consumption of Federated Data Sourcesen
dc.title.alternativeDistributed Autonomous Network Management Systems/Network Operation and Management Symposiumen
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/walshebren
dc.identifier.peoplefinderurlhttp://people.tcd.ie/osulldpsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/rbrennaen
dc.identifier.rssinternalid82487en
dc.contributor.sponsorScience Foundation Ireland (SFI)en
dc.contributor.sponsorGrantNumber08/SRC/I1408en


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record