RFC7073: A Reputation Response Set for Email Identifiers

Download in PDF format Download in text format






Internet Engineering Task Force (IETF)                     N. Borenstein
Request for Comments: 7073                                      Mimecast
Category: Standards Track                                   M. Kucherawy
ISSN: 2070-1721                                            November 2013


            A Reputation Response Set for Email Identifiers

Abstract

   This document defines a response set for describing assertions a
   reputation service provider can make about email identifiers, for use
   in generating reputons.

Status of This Memo

   This is an Internet Standards Track document.

   This document is a product of the Internet Engineering Task Force
   (IETF).  It represents the consensus of the IETF community.  It has
   received public review and has been approved for publication by the
   Internet Engineering Steering Group (IESG).  Further information on
   Internet Standards is available in Section 2 of RFC 5741.

   Information about the current status of this document, any errata,
   and how to provide feedback on it may be obtained at
   http://www.rfc-editor.org/info/rfc7073.

Copyright Notice

   Copyright (c) 2013 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.









Borenstein & Kucherawy       Standards Track                    [Page 1]

RFC 7073             Email Identifiers Response Set        November 2013


Table of Contents

   1. Introduction ....................................................2
   2. Terminology and Definitions .....................................2
      2.1. Key Words ..................................................2
      2.2. Email Definitions ..........................................2
      2.3. Other Definitions ..........................................3
   3. Discussion ......................................................3
      3.1. Assertions .................................................3
      3.2. Response Set Extensions ....................................4
      3.3. Identifiers ................................................4
      3.4. Query Extensions ...........................................5
   4. IANA Considerations .............................................5
      4.1. Registration of 'email-id' Reputation Application ..........5
   5. Security Considerations .........................................6
   6. References ......................................................7
      6.1. Normative References .......................................7
      6.2. Informative References .....................................7
   Appendix A. Positive vs. Negative Assertions .......................8
   Appendix B. Acknowledgments ........................................8

1.  Introduction

   This document specifies a response set for describing the reputation
   of an email identifier.  A "response set" in this context is defined
   in [RFC7070] and is used to describe assertions a reputation service
   provider can make about email identifiers as well as metadata that
   can be included in such a reply beyond the base set specified there.

   An atomic reputation response is called a "reputon", defined in
   [RFC7071].  That document also defines a media type to contain a
   reputon for transport, and creates a registry for reputation
   applications and the interesting parameters of each.

2.  Terminology and Definitions

   This section defines terms used in the rest of the document.

2.1.  Key Words

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in [KEYWORDS].

2.2.  Email Definitions

   Commonly used definitions describing entities in the email
   architecture are defined and discussed in [EMAIL-ARCH].



Borenstein & Kucherawy       Standards Track                    [Page 2]

RFC 7073             Email Identifiers Response Set        November 2013


2.3.  Other Definitions

   Other terms of importance in this document are defined in [RFC7070],
   the base document for the reputation services work.

3.  Discussion

   The expression of reputation about an email identifier requires
   extensions of the base set defined in [RFC7070].  This document
   defines and registers some common assertions about an entity found in
   a piece of [MAIL].

3.1.  Assertions

   The "email-id" reputation application recognizes the following
   assertions:

   abusive:  The subject identifier is associated with sending or
      handling email of a personally abusive, threatening, or otherwise
      harassing nature

   fraud:  The subject identifier is associated with the sending or
      handling of fraudulent email, such as "phishing" (some good
      discussion on this topic can be found in [IODEF-PHISHING])

   invalid-recipients:  The subject identifier is associated with
      delivery attempts to nonexistent recipients

   malware:  The subject identifier is associated with the sending or
      handling of malware via email

   spam:  The subject identifier is associated with the sending or
      handling of unwanted bulk email

   For all assertions, the "rating" scale is linear: a value of 0.0
   means there is no data to support the assertion, a value of 1.0 means
   all accumulated data support the assertion, and the intervening
   values have a linear relationship (i.e., a score of "x" is twice as
   strong of an assertion as a value of "x/2").












Borenstein & Kucherawy       Standards Track                    [Page 3]

RFC 7073             Email Identifiers Response Set        November 2013


3.2.  Response Set Extensions

   The "email-id" reputation application recognizes the following
   OPTIONAL extensions to the basic response set defined in [RFC7071]:

   email-id-identity:  A token indicating the source of the identifier;
      that is, where the subject identifier was found in the message.
      This MUST be one of:

      dkim: The signing domain, i.e., the value of the "d=" tag, found
            on a valid DomainKeys Identified Mail [DKIM] signature in
            the message

      ipv4: The IPv4 address of the client

      ipv6: The IPv6 address of the client

      rfc5321.helo:  The RFC5321.HELO value used by the client (see
            [SMTP])

      rfc5321.mailfrom:  The RFC5321.MailFrom value of the envelope of
            the message (see [SMTP])

      rfc5322.from:  The RFC5322.From field of the message (see [MAIL])

      spf:  The domain name portion of the identifier (RFC5321.MailFrom
            or RFC5321.HELO) verified by [SPF]

   sources:  A token relating a count of the number of sources of data
      that contributed to the reported reputation.  This is in contrast
      to the "sample-size" parameter, which indicates the total number
      of reports across all reporting sources.

   A reply that does not contain the "identity" or "sources" extensions
   is making a non-specific statement about how the reputation returned
   was developed.  A client can use or ignore such a reply at its
   discretion.

3.3.  Identifiers

   In evaluating an email message on the basis of reputation, there can
   be more than one identifier in the message needing to be validated.
   For example, a message may have different email addresses in the
   RFC5321.MailFrom parameter and the RFC5322.From header field.  The
   RFC5321.Helo identifier will obviously be different.  Consequently,
   the software evaluating the email message may need to query for the
   reputation of more than one identifier.




Borenstein & Kucherawy       Standards Track                    [Page 4]

RFC 7073             Email Identifiers Response Set        November 2013


   The purpose of including the identity in the reply is to expose to
   the client the context in which the identifier was extracted from the
   message under evaluation.  In particular, several of the items listed
   are extracted verbatim from the message and have not been subjected
   to any kind of validation, while a domain name present in a valid
   DKIM signature has some more reliable semantics associated with it.
   Computing or using reputation information about unauthenticated
   identifiers has substantially reduced value, but can sometimes be
   useful when combined.  For example, a reply that indicates a message
   contained one of these low-value identifiers with a high "spam"
   rating might not be worthy of notice, but a reply that indicates a
   message contained several of them could be grounds for suspicion.

   A client interested in checking these weaker identifiers would issue
   a query about each of them using the same assertion (e.g., "spam"),
   and then collate the results to determine which ones and how many of
   them came back with ratings indicating content of concern, and take
   action accordingly.  For stronger identifiers, decisions can
   typically be made based on a few or even just one of them.

3.4.  Query Extensions

   A query within this application can include the OPTIONAL query
   parameter "identity" to indicate which specific identity is of
   interest to the query.  Legal values are the same as those listed in
   Section 3.2.

4.  IANA Considerations

   This memo presents one action for IANA, namely the registration of
   the reputation application "email-id".

4.1.  Registration of 'email-id' Reputation Application

   This section registers the "email-id" reputation application, as per
   the IANA Considerations section of [RFC7071].  The registration
   parameters are as follows:

   o  Application symbolic name: email-id

   o  Short description: Evaluates DNS domain names or IP addresses
      found in email identifiers

   o  Defining document: [RFC7073]

   o  Status: current





Borenstein & Kucherawy       Standards Track                    [Page 5]

RFC 7073             Email Identifiers Response Set        November 2013


   o  Subject: A string appropriate to the identifier of interest (see
      Section 3.2 of this document)

   o  Application-specific query parameters:

      identity:  (current) as defined in Section 3.4 of this document

   o  Application-specific assertions:

      abusive:  (current) as defined in Section 3.1 of this document

      fraud:  (current) as defined in Section 3.1 of this document

      invalid-recipients:  (current) as defined in Section 3.1 of this
            document

      malware:  (current) as defined in Section 3.1 of this document

      spam: (current) as defined in Section 3.1 of this document

   o  Application-specific response set extensions:

      identity:  (current) as defined in Section 3.2 of this document

5.  Security Considerations

   This document is primarily an IANA action and doesn't describe any
   protocols or protocol elements that might introduce new security
   concerns.

   Security considerations relevant to email and email authentication
   can be found in most of the documents listed in the References
   sections below.  Information specific to use of reputation services
   can be found in [CONSIDERATIONS].

















Borenstein & Kucherawy       Standards Track                    [Page 6]

RFC 7073             Email Identifiers Response Set        November 2013


6.  References

6.1.  Normative References

   [DKIM]     Crocker, D., Ed., Hansen, T., Ed., and M. Kucherawy, Ed.,
              "DomainKeys Identified Mail (DKIM) Signatures", STD 76,
              RFC 6376, September 2011.

   [EMAIL-ARCH]
              Crocker, D., "Internet Mail Architecture", RFC 5598, July
              2009.

   [KEYWORDS] Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC7070]  Borenstein, N., Kucherawy, M., and A. Sullivan, "An
              Architecture for Reputation Reporting", RFC 7070, November
              2013.

   [RFC7071]  Borenstein, N. and M. Kucherawy, "A Media Type for
              Reputation Interchange", RFC 7071, November 2013.

   [SMTP]     Klensin, J., "Simple Mail Transfer Protocol", RFC 5321,
              October 2008.

   [SPF]      Wong, M. and W. Schlitt, "Sender Policy Framework (SPF)
              for Authorizing Use of Domains in E-Mail, Version 1", RFC
              4408, April 2006.

6.2.  Informative References

   [CONSIDERATIONS]
              Kucherawy, M., "Operational Considerations Regarding
              Reputation Services", Work in Progress, May 2013.

   [IODEF-PHISHING]
              Cain, P. and D. Jevans, "Extensions to the IODEF-Document
              Class for Reporting Phishing", RFC 5901, July 2010.

   [MAIL]     Resnick, P., Ed., "Internet Message Format", RFC 5322,
              October 2008.










Borenstein & Kucherawy       Standards Track                    [Page 7]

RFC 7073             Email Identifiers Response Set        November 2013


Appendix A.  Positive vs. Negative Assertions

   [CONSIDERATIONS] some current theories about reputation, namely that
   it will possibly have more impact to develop positive reputations and
   focus on giving preferential treatment to content or sources that
   earn those.  However, the assertions defined in this document are all
   clearly negative in nature.

   In effect, this document is recording current use of reputation and
   of this framework in particular.  It is expected that, in the future,
   the application being registered here will be augmented, and other
   applications registered, that focus more on positive assertions
   rather than negative ones.

Appendix B.  Acknowledgments

   The authors wish to acknowledge the contributions of the following to
   this specification: Scott Hollenbeck, Scott Kitterman, Peter Koch,
   John Levine, Danny McPherson, S. Moonesamy, Doug Otis, and David F.
   Skoll.

Authors' Addresses

   Nathaniel Borenstein
   Mimecast
   203 Crescent St., Suite 303
   Waltham, MA  02453
   USA

   Phone: +1 781 996 5340
   EMail: nsb@guppylake.com


   Murray S. Kucherawy
   270 Upland Drive
   San Francisco, CA  94127
   USA

   EMail: superuser@gmail.com












Borenstein & Kucherawy       Standards Track                    [Page 8]