Skip to Content.
Sympa Menu

per-entity - [Per-Entity] A little MDQ mvp in AWS

Subject: Per-Entity Metadata Working Group

List archive

[Per-Entity] A little MDQ mvp in AWS


Chronological Thread 
  • From: Christopher Hubing <>
  • To: Per-Entity Metadata Working Group <>
  • Subject: [Per-Entity] A little MDQ mvp in AWS
  • Date: Thu, 13 Oct 2016 16:02:15 +0000
  • Authentication-results: spf=none (sender IP is ) ;
  • Ironport-phdr: 9a23:Cqsj9xTaV9iYg9i39rF4aWqfjtpsv+yvbD5Q0YIujvd0So/mwa64ZxSN2/xhgRfzUJnB7Loc0qyN7PCmBDdLuMvJmUtBWaIPfidNsd8RkQ0kDZzNImzAB9muURYHGt9fXkRu5XCxPBsdMs//Y1rPvi/6tmZKSV3XfDB4LeXtG4PUk9//l6Xro8WSME10g2+FbKk3AROqrBnA/uwbnYJuI7o4giTOuHhJfese6nlvOErbyw7x/IK9+oJi7zV4uvQq8MtFVqO8eL43G+92FjMjZks1/szwqR7bTQaVrlsBWGQQnVIcCAjY5hD8Ub/wtDf3rOxwxHPcMMHrG+NnEQ++5rtmHUe7wBwMMCQ0pSSO0pR9
  • Spamdiagnosticmetadata: NSPM
  • Spamdiagnosticoutput: 1:99


I threw together a little MDQ rapid prototype in AWS, using all AWS serverless features.

First off, I grabbed the InCommon metadata and dumped it into a DynamoDB table (with Java). I used API Gateway with CloudFront wrapped around it for CDN. The API Gateway talks to a Lambda function (nodejs atm) to retrieve the MD and return it to the client.

So, for example, you could hit one of the URLs below and be returned metadata for that entityid (or insert one of your own entityIDs). It currently uses a self-signed cert, so ignore any SSL warnings.

https://mdqtest.testbed.tier.internet2.edu/entities/urn:mace:incommon:uiowa.edu
https://mdqtest.testbed.tier.internet2.edu/entities/urn:mace:incommon:psu.edu
https://mdqtest.testbed.tier.internet2.edu/entities/urn:mace:incommon:osu.edu
https://mdqtest.testbed.tier.internet2.edu/entities/https://uwdcc.org/shibboleth

The caching on the CDN side defaults to 300 seconds, but can be tuned anywhere from 0 (no caching) to 3600 seconds.

I wanted to demonstrate that is possible to create a massively scalable app in a short period of time with little infrastructure to manage. The metadata isn't signed, but that could be done rather easily I would think.

The cost? For sustained 1000 requests/second, it would be about $100/month. I'm not sure what the current stats are for the InC metadata requests, I need to check out Splunk. But something like this could be complementary to the current aggregate hosting.

Question/Comments/Thoughts?


-chris



mdqtest.testbed.tier.internet2.edu is an alias for d24u4rzbem9xup.cloudfront.net.

d24u4rzbem9xup.cloudfront.net has address 52.84.76.67
d24u4rzbem9xup.cloudfront.net has address 52.84.76.122
d24u4rzbem9xup.cloudfront.net has address 52.84.76.174
d24u4rzbem9xup.cloudfront.net has address 52.84.76.123
d24u4rzbem9xup.cloudfront.net has address 52.84.76.72
d24u4rzbem9xup.cloudfront.net has address 52.84.76.179
d24u4rzbem9xup.cloudfront.net has address 52.84.76.241
d24u4rzbem9xup.cloudfront.net has address 52.84.76.253



Archive powered by MHonArc 2.6.19.

Top of Page