Issue: Features for NIF URIs

From NLP2RDF-Wiki

Jump to: navigation, search

This page is one of the Issues, which are currently under discussion

Timline, NIF 2.0 and todos in this wiki (help is appreciated, registration is ) e
All is welcome, see here how to Get Involved, see here for a post about the roadmap
Collection of use cases, requirements and Category:Issues <- current phase, your best chance to contribute >> updating NIF 2.0 Draft >> discussion until all issues are resolved >> implementation and infrastructure >> publication of NIF 2.0 (hopefully end of 2012 )
* Bug tracker for software: Pages from https://nlp2rdf.org that need migration: tutorial docu, tutorial challenges, list of implementations, demos and development help, tutorial challenge explanation send an , if you are not yet listed on the Involved People page

1 Introduction
2 Examples
- 2.1 LinkedData HTML
3 Collection of possible features Please add any you can think of
4 Examples
5 Options
6 Links

Introduction

We are currently collecting possible features. Note that this discussion is independent from the syntax of identifiers, which is discussed here: Issue:_Syntax_for_NIF_URIs. It is easy to see, that '#offset_717_729' and '#char=717,12' can have equivalent meaning.

Examples

Some examples to make the discussion more concrete.

LinkedData HTML

First occurrence of the string Semantic Web on http://www.w3.org/DesignIssues/LinkedData.html . Note that the document:

has a total of 26610 characters
is an information resource
the string Semantic Web occurs for the first time at position 717, has 12 characters and ends at 729
we use the prefix ld: for 'http://www.w3.org/DesignIssues/LinkedData.html#'

Collection of possible features Please add any you can think of

desc=offset or desc=hash
- used in NIF 1.0 to tell how the fragment identifier should be parsed
beginIndex = 717
endIndex = 729
offset = 717,729
length = 12
- length of the string
encoding = utf-8
text = Semantic%200Web
hash_with_context = 4_12_711ffecc1815feff00f9314eeb6eaa12_Semantic%20Web
- produced by md5 ("The (Semantic Web) isn");
hash_of_text = 103b850abb89c034ccc2a2d2e6756fe3
- produced by md5 ("Semanic Web");
contextLength = 4
leftContext = 'The '
rightContext = ' isn'
regex = Semantic\wWeb
- selects all occurences that matches the regex

Issue: Features for NIF URIs

Contents

Introduction

Examples

LinkedData HTML

Collection of possible features Please add any you can think of

Examples

Options

Links

Previous discussions

Current NIF 1.0 URIs

Literature

Other (W3C and IETF)

Personal tools

Namespaces

Variants

Views

Actions

Search

Back to main:

NIF 2.0 Draft

Documentation

ToDo - Help Wanted

Navigation

Toolbox