Skip to Content.
Sympa Menu

inc-ops-notifications - Re: [InCommon NOTICE] InCommon MDQ metadata retrieval issues for some clients

Subject: InCommon Operations Notifications

List archive

Re: [InCommon NOTICE] InCommon MDQ metadata retrieval issues for some clients


Chronological Thread 
  • From: "Nicole Roy" <>
  • To:
  • Subject: Re: [InCommon NOTICE] InCommon MDQ metadata retrieval issues for some clients
  • Date: Fri, 08 Oct 2021 15:40:04 -0600
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=internet2.edu; dmarc=pass action=none header.from=internet2.edu; dkim=pass header.d=internet2.edu; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=3793Io19PN4B9to59yHL3lhgx4+iIJkG03UApSq/OZI=; b=JnRcoJL04yfe/CI827AfX+kJkxe4g3XQGhMz+Q8DM+f5EyRIiYf2I8Tu3HBtcBBZS+cDs5Y3pHuVJeFSI/j3VbangWePCw7hINo3yEwNrrGHB3ht1hnkPX6ROTX5s2Z/tAf3Tdr3IPfLh/b0TxlUKQkNLlpvstJ7bmgP0REOOSQpiDI7WUbytWS18YZ2cJOx69+3ELe/3ymQ+8akhc1tmILVpERdQwvT03ABSsYJWZHxA+m+DQ2cd4STHudh4h68xXdq9P9H1KXeGgUi6QXTmwBV95asDd0RdttoUgODMB+B8ZW7c8jExXEFT+bSdGGBxpeHsvaes+ALpB/0i1H3kw==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=mhU36sYFRUYpew0TCfMbFgIgp3Xr6rkBOO7CF5zOvP6FtFQNUnGOuIy0VXwsHlNPo9r00iY7dt1VO+1y1KF6i41anp0oCYTQHw2/UeIxAL1AIq5tdLeD43Xo+ISGH52ig5cvWoYbBha4y/L9lh52XffBFYQG0pY1YXJh1K1CzVvKwLeCAGXjzUtez8DrGhk8mDQDSzzT0A4IF69GL44SimT1fUsMSbhtiaIglOTGq9x19GExb+5RmmEDk9KR3stWfei6ihehr0Ahh41+kFLpgL9zF4+yLpdtAoSls8mYUzpuLgO1Z1+vq8B/UtDedC8KAFEpgrO6l0d7oP+yni10tw==

AWS has notified us that this issue has been resolved. We are closing the issue within InCommon Operations. We will do an after-action review after we are through the next couple weeks of eduroam-related activities that we are engaged in, and will prepare a report based on that AAR. We will also work with the InCommon Technical Advisory Committee on this.

Best Regards,

Nicole Roy
Pronouns: She/her/hers
Director of Technology and Strategy
InCommon / Internet2 Trust and Identity Services

On 8 Oct 2021, at 14:04, Nicole Roy wrote:

We are seeing signs that some of the CloudFront edge locations that were formerly failing may be starting to return correct MDQ results now. We have not received any confirmation from AWS that this is the case, but we are cautiously optimistic. I’ve updated the outage wiki page, linked below. We will communicate more as we receive more information from AWS.

Thank you,

Nicole

On 8 Oct 2021, at 12:23, Nicole Roy wrote:

Hello,

We continue to pursue this issue with AWS engineers. They have identified a problem with encoding certain characters from a request, which causes them not to be able to correctly resolve an entity descriptor within the S3 origin location that backs our MDQ service. They are working on a fix. As of this update, we do not have an ETA from them, but they assure us that this has the highest internal priority within AWS.

Updates will continue to be posted at: https://spaces.at.internet2.edu/display/federationops/2021-10-07+MDQ+intermittent+outage

Best Regards,

Nicole

On 7 Oct 2021, at 14:34, Nicole Roy wrote:

Another update, we have identified a method of temporarily pinning MDQ requests to a specific, currently-known-good CloudFront edge location. Details at: https://spaces.at.internet2.edu/display/federationops/2021-10-07+MDQ+intermittent+outage

Deployers are advised to test this approach locally, and ensure that whichever CloudFront edge location they pick, actually works before updating hosts files on their affected systems.

Best,

Nicole

On 7 Oct 2021, at 12:35, Nicole Roy wrote:

All future status updates will be posted in the Internet2 wiki, at: https://spaces.at.internet2.edu/display/federationops/2021-10-07+MDQ+intermittent+outage

As of 12:30 MDT, AWS support is able to reproduce this issue and is escalating it within AWS.

Best,

Nicole

On 7 Oct 2021, at 10:06, Nicole Roy wrote:

Update 2020-10-07 10:00 MDT

We have opened a support case with Amazon Web Services with regard to the edge cache locations which appear to be failing:

13.32.208.20
13.32.208.84
13.32.208.58
13.32.208.38

On 7 Oct 2021, at 9:31, Nicole Roy wrote:

As of 9:15 a.m. MDT on October 7, we are able to reproduce the issue and it appears to be isolated to a number of CloudFront servers. We are working to further identify the problem with these distribution points.

Best Regards,

Nicole

On 7 Oct 2021, at 9:06, Nicole Roy wrote:

Hello,

This morning, InCommon operations was notified of some intermittent issues by some metadata clients, in retrieving SAML metadata from https://mdq.incommon.org. We are actively investigating this issue and will share more information when we know more. In the meantime, if you are experiencing issues with the MDQ service, please send email to , including (if possible) detailed information about the client you’re using (SAML software and version), IP address of the client, physical location (approximate), log data if share-able, etc.

Best Regards,

Nicole Roy
Pronouns: She/her/hers
Director of Technology and Strategy
InCommon / Internet2 Trust and Identity Services

Attachment: signature.asc
Description: OpenPGP digital signature




Archive powered by MHonArc 2.6.24.

Top of Page