[media-types] Fwd: "Cool" Linked Data URIs for all IANA mediatypes?

Stian Soiland-Reyes <soiland-reyes@cs.manchester.ac.uk> Tue, 14 October 2014 01:07 UTC

Return-Path: <stian@mygrid.org.uk>
X-Original-To: media-types@ietfa.amsl.com
Delivered-To: media-types@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 558971A1A76 for <media-types@ietfa.amsl.com>; Mon, 13 Oct 2014 18:07:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 4.22
X-Spam-Level: ****
X-Spam-Status: No, score=4.22 tagged_above=-999 required=5 tests=[BAYES_20=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FM_FORGED_GMAIL=0.622, J_CHICKENPOX_26=0.6, J_CHICKENPOX_35=0.6, J_CHICKENPOX_36=0.6, J_CHICKENPOX_45=0.6, J_CHICKENPOX_47=0.6, J_CHICKENPOX_48=0.6, SPF_PASS=-0.001] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id S-XwtTSCqwo8 for <media-types@ietfa.amsl.com>; Mon, 13 Oct 2014 18:07:26 -0700 (PDT)
Received: from mail-oi0-x235.google.com (mail-oi0-x235.google.com [IPv6:2607:f8b0:4003:c06::235]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9B9C31A004C for <media-types@ietf.org>; Mon, 13 Oct 2014 18:07:26 -0700 (PDT)
Received: by mail-oi0-f53.google.com with SMTP id v63so15088573oia.12 for <media-types@ietf.org>; Mon, 13 Oct 2014 18:07:25 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mygrid.org.uk; s=google; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=Uhse4M4IJq6gk3NuMVMRcGDbLu9G/QsPRmK5E1s64t0=; b=e6LwjXTT/jMM5CQXVUavliem8bglOSnNe1hvhfj++ku3jzdXIkY/xw30NPmlJY2Ahq yeg2cx56ItWTCvz9p8v7SjXmmycednq3Vs+S4sz3zqrAr1xsjjEfeLIH4fPsE3X+E05R QZ3PWLyWj5hJeAGLHaLt0Z6TZ98Djoc4xa32o=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:content-type; bh=Uhse4M4IJq6gk3NuMVMRcGDbLu9G/QsPRmK5E1s64t0=; b=iIQMazxN5BBG9wAE8wtCZlDLjKicVc+Y8ISDynpvSm3Atd8DZqvwVpmZucIzqY8lMn 8xqH0wuRFcy0JU4qewiSm0IMslLr8gdT9kkdCzHDIvcJVF9dbGiaSv41/XWC4R/oloRJ ajTxYL7aT9sEt75Y38x7tHb0wky///kExZODHHoLouPmhMIbC4elAdnxzQnlvdV6htIR 9aBDc4sMT8hNyLRg8gyBVZl348ZH4jIMGEKyTnVp8z6D+XZV64upIs99bdmKYyDrxkz6 ZvMmGy67PW9R2u7WO9epYaQzR8AayI0ItG+CPWAXk1+gm4QXAIpNdwMULwTi8iJo0QFl 0nig==
X-Gm-Message-State: ALoCoQkbdVlpRFJxYHnCmNJnVAMiW/hBGkIWpOzPHCReBlWNsoBxcg1QBsHkClcovX78VBTpEody
X-Received: by 10.202.129.147 with SMTP id c141mr22448oid.59.1413248845354; Mon, 13 Oct 2014 18:07:25 -0700 (PDT)
MIME-Version: 1.0
Sender: stian@mygrid.org.uk
Received: by 10.76.153.229 with HTTP; Mon, 13 Oct 2014 18:07:05 -0700 (PDT)
In-Reply-To: <CAPRnXt=hQHvkemE5T7ibA1AWj6dx9QBja79GHzEfC_K-WsjOHg@mail.gmail.com>
References: <CAPRnXt=hQHvkemE5T7ibA1AWj6dx9QBja79GHzEfC_K-WsjOHg@mail.gmail.com>
From: Stian Soiland-Reyes <soiland-reyes@cs.manchester.ac.uk>
Date: Tue, 14 Oct 2014 02:07:05 +0100
X-Google-Sender-Auth: 1j4V-WbMo1Qh03Sin2k3rbw5qEA
Message-ID: <CAPRnXtkZO=YbmNL51YTEg3LQpXdzUxk9ti7=XbABN03uFJQhgQ@mail.gmail.com>
To: media-types@ietf.org
Content-Type: text/plain; charset="UTF-8"
Archived-At: http://mailarchive.ietf.org/arch/msg/media-types/DW4gkpa7O_xPW9qqf3xJ2SA1_n4
Subject: [media-types] Fwd: "Cool" Linked Data URIs for all IANA mediatypes?
X-BeenThere: media-types@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "IANA mailing list for reviewing Media Type \(MIME Type, Content Type\) registration requests." <media-types.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/media-types>, <mailto:media-types-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/media-types/>
List-Post: <mailto:media-types@ietf.org>
List-Help: <mailto:media-types-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/media-types>, <mailto:media-types-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 14 Oct 2014 01:07:30 -0000

tl;dr: Ensure every registered media type has a corresponding URI,
avoid 404 for http://www.iana.org/assignments/media-types/text/plain


In Linked Data [1][2] we need identifiers for common resources. These
identifiers are typically so-called "Cool URIs" [3].

Internet Media Types are commonly referred to when describing resource
metadata, particularly in the case of packaging or redistributing
resources.


Many newer mediatypes have individual pages at iana, like:

  http://www.iana.org/assignments/media-types/application/pdf

Put many mediatypes do not have such a "cool URI", e.g.

  http://www.iana.org/assignments/media-types/text/plain
  404 Not Found


I therefore feel that as a third-party I am not at liberty to "invent"
such URIs by simply appending the registered (or unregistered!) type
to an URI template

  http://www.iana.org/assignments/media-types/{type}/{subtype}



Additionally there is no RDF information available for media-types
using content-negotiation at iana.org - some of the URIs give HTML,
other plain text.


Some third-party efforts have been made to create alternative URIs to
identify media types:

http://purl.org/NET/mediatypes/text/plain

(sadly the above does not content-negotiate to RDF either)

Those are achieved by screen scraping of iana,org, and is also
available as a (probably by now outdated) RDF dump [5].



Example of usage of these identifier beyond the Linked Data world,
from the COMBINE Archive [6], page 7:

> The Internet Media type URI should be of the form
> http://purl.org/NET/mediatypes/
> followed by
> the Media type name. Here are a few examples:
>
> for png (Portable Network Graphics) http://purl.org/NET/mediatypes/image/png
> for pdf (Portable Document Format) http://purl.org/NET/mediatypes/application/pdf

(There the use of URIs allow one to mix and match official IANA media
types with other format identifiers)





How to describe mediatypes in RDF is still a bit unclear, it is
however common to use Dublin Core's "format" property, although there
are two approaches:

  <http://example.com/image.png> dc:format "text/html"

No links to follow here.. and dc:format also allows any kind of format
literals like "VHS" and "Polaroid". Easy to use, though.


The dcterms:format property [7] is more suited as it goes to a
resource, but is a bit cumbersome to describe correctly as an IANA
type, see [4]





I would want to propose for IANA to support this directly for all
registered types without needing the volunteer effort of
http://purl.org/NET/mediatypes/


<http://example.com/page.html> dcterms:format
<http://www.iana.org/assignments/media-types/application/pdf> .


Resolving http://www.iana.org/assignments/media-types/application/pdf
with content negotiation to Accept: text/turtle should give something like:

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix dcterms: <http://purl.org/dc/terms/> .
@prefix dcam: <http://purl.org/dc/dcam/> .

<http://www.iana.org/assignments/media-types/application/pdf> a
dcterms:FileFormat ;
  owl:sameAs <http://purl.org/NET/mediatypes/application/pdf> ;
  dcam:memberOf dcterms:IMT ;
  rdfs:label "application/pdf" ;
  rdf:value "application/pdf" ;
  rdfs:isDefinedBy <http://www.iana.org/assignments/media-types> ;
  rdfs:seeAlso <http://www.iana.org/assignments/media-types/application/pdf.txt>


Resolving <http://www.iana.org/assignments/media-types> for
Accept: text/turtle should give the concatenation (without repeated @prefix)
of all those.

Support for other RDF formats like application/rdf and
application/ld+json should also be considered.


Is this something for IANA to consider?



As a minimum what I am hoping for is a predictable URI pattern that we
can use for any registered IANA type - e.g. so that
http://www.iana.org/assignments/media-types/text/plain and friends
don't give a 404.




[1] http://www.w3.org/DesignIssues/LinkedData.html
[2] http://www.w3.org/standards/semanticweb/data
[3] http://www.w3.org/TR/cooluris/
[4] https://gist.github.com/stain/4635250
[5] http://mediatypes.appspot.com/
[6] http://co.mbine.org/system/files/omex.version-1.pdf
[7] http://purl.org/dc/terms/format



-- 
Stian Soiland-Reyes, myGrid team
School of Computer Science
The University of Manchester
http://soiland-reyes.com/stian/work/ http://orcid.org/0000-0001-9842-9718