Phil 7.20.16

7:00 – 4:30 VTX

  • MVA Bike renew today!
  • More thesis proposal cleanup. Meeting with Don Thursday at 430, Wayne 4:30 on Friday
  • Finishing EIPGateway task
    • Deleted Controllers – works
    • Deleted services – works
    • Checkin – done
    • Verify on CI
    • Verify on QA?
    • Close out task.

Phil 7.19.16

7:00 – 5:30 VTX

  • Refining proposal
  • Working the Gateway exposes old APIs
  • gtc-services cmd: java -Dspring.profiles.active=swagger -jar build\libs\gtc-server.war
  • ServiceRegistrar command: java -jar build\libs\eureka.war
  • Important URLs
  • Test login: user/N!r@lR1t
  • Got the annotations commented out and the services not showing up. Tomorrow I’ll pull out the
  • Here’s the thread with Balaji:[‎7/‎19/‎2016 11:00 AM]Hiya, I hope you’re doing well.

    Anyway, I’ve been tasked with stripping out services from the gtc-gateway

    [‎7/‎19/‎2016 11:00 AM] Balaji Patakula:

    yeah, i am good thanks.

    [‎7/‎19/‎2016 11:01 AM]

    I’ve got it downloaded, built and running. How do I know that it’s actually working?

    [‎7/‎19/‎2016 11:02 AM] Balaji Patakula:

    run it as java -jar build/libs/gtc-serv…war

    and whichever service u r testing….should be run too

    [‎7/‎19/‎2016 11:03 AM]

    already built. Running as java -jar gtc-service.war

    [‎7/‎19/‎2016 11:03 AM] Balaji Patakula:

    run serviceregistrar too

    [‎7/‎19/‎2016 11:03 AM]

    I don’t have that war file

    Aaron had me set the <serviceRegistry> to ‘none’.

    [‎7/‎19/‎2016 11:04 AM] Balaji Patakula:

    u should get the project ServiceRegistrar

    no, u have to turn it on

    [‎7/‎19/‎2016 11:04 AM]

    Ok. Start in which order?

    [‎7/‎19/‎2016 11:04 AM] Balaji Patakula:

    to test the intercation

    order does not matter. U understadn the interactions of the gateway architecture, correct?

    [‎7/‎19/‎2016 11:05 AM]

    No. I am totally new to this.

    [‎7/‎19/‎2016 11:06 AM] Balaji Patakula:

    mmm….let me see if i can tell u in brief.

    [‎7/‎19/‎2016 11:06 AM]

    Where is the ServiceRegistrar project?

    [‎7/‎19/‎2016 11:07 AM] Balaji Patakula:

    service registrar is the registry of all the services. http://dev1.netcentric.local/svn/CommonComponents/ServiceRegistrar/trunk

    [‎7/‎19/‎2016 11:07 AM]

    Downloading now

    [‎7/‎19/‎2016 11:08 AM] Balaji Patakula:

    so every microservice on startup registers with the service registrar. thats why u have the serviceregistry configuration on all services and that should be set to original localhost:8761….

    [‎7/‎19/‎2016 11:08 AM]

    Ok.

    [‎7/‎19/‎2016 11:08 AM] Balaji Patakula:

    now all server requests from proxy are routed to gateway

    [‎7/‎19/‎2016 11:09 AM]

    Here are the services that I need to remove:

    [‎7/‎19/‎2016 11:09 AM]

    • com.vistronix.gtc.service.alert
    • com.vistronix.gtc.service.data
    • com.vistronix.gtc.service.routing
    • com.vistronix.gtc.service.simulator
    • com.vistronix.gtc.service.stock
    • com.vistronix.gtc.service.subscription

    how do I ping them

    [‎7/‎19/‎2016 11:09 AM] Balaji Patakula:

    gateway queries the service registar….finds the services by name and load balances the requests to appropriate service

    so thats the architecture

    [‎7/‎19/‎2016 11:10 AM]

    Got it, in general terms. Though I’ll need to know service names obviously

    [‎7/‎19/‎2016 11:11 AM] Balaji Patakula:

    so u need the service that u r testing, registrar and gateway running and make the request to the gateway and the request will be handled by microservice

    [‎7/‎19/‎2016 11:11 AM]

    So I need to download and build all of the projects that we are *not* using?
    [‎7/‎19/‎2016 11:12 AM] Balaji Patakula:

    no, i just explained how u would test a service through gateway

    the best way to remove them would be….go to com.vistronix.gtc.web

    that package and comment @Controller nnotation on the services that u mentioned above

    [‎7/‎19/‎2016 11:13 AM] Balaji Patakula:

    and u r done

    [‎7/‎19/‎2016 11:13 AM]

    So back to the basic questions that I asked at the beginning:

    [‎7/‎19/‎2016 11:13 AM] Balaji Patakula:

    so, once u comment those controllers, they are not exposed through gateway anymore

    [‎7/‎19/‎2016 11:14 AM]

    1) How do I know that the gateway service is running?

    2) How do I know that the services that I want to remove are accessable.

    3) How do I know that they are no longer accessable?

    [‎7/‎19/‎2016 11:15 AM]

    Is there a set of urls that I access?

    [‎7/‎19/‎2016 11:15 AM] Balaji Patakula:

    yes, they are in the swagger UI on the gateway

    [‎7/‎19/‎2016 11:16 AM]

    That means slightly more than nothing to me, in that I recognise the words.

    How do I compile, run and access said swagger UI?

    [‎7/‎19/‎2016 11:17 AM]

    I have been working on getting Google searches to improve. Not this.

    [‎7/‎19/‎2016 11:18 AM] Balaji Patakula:

    i am trying to find the doc in confluence

    [‎7/‎19/‎2016 11:18 AM]

    Great! Thanks.

    [‎7/‎19/‎2016 11:19 AM] Balaji Patakula:

    http://dockerapps.eip.nj.vistronix.com/gtc-server/swagger-ui.html

    https://visconfnetc.vistronix.com:8443/display/SPD/ApplicationGateway

    [‎7/‎19/‎2016 11:20 AM]

    what’s the login?

    [‎7/‎19/‎2016 11:20 AM] Balaji Patakula:

    basically thats the swagger UI on gateway

    same as other….user/N!r@lR1t

    [‎7/‎19/‎2016 11:21 AM] Balaji Patakula:

    so when u comment those controllers that i told u, above u should not see them here

    [‎7/‎19/‎2016 11:22 AM]

    So when I rebuild the gateway, I should see these pages disappear?

    [‎7/‎19/‎2016 11:23 AM] Balaji Patakula:

    the controller that u removed should disappear

    [‎7/‎19/‎2016 11:23 AM]

    Very good. I’m in the process of building the ServiceRegistrar now. I’ll ping you with questions later. Thanks!

    [‎7/‎19/‎2016 11:24 AM] Balaji Patakula:

    ok

    [‎7/‎19/‎2016 11:26 AM]

    So I should test by accessing http://localhost:8800/gtc-server/swagger-ui.html

    [‎7/‎19/‎2016 11:27 AM] Balaji Patakula:

    yes

    [‎7/‎19/‎2016 11:27 AM]

    And the ServiceRegistrar jarfile is eureka.war?

    [‎7/‎19/‎2016 11:27 AM] Balaji Patakula:

    yes

    u dont need eureka unless u want to test the calls

    [‎7/‎19/‎2016 11:28 AM]

    But if it’s not showing up in swagger then I can safely assume thatt it’s gone?

    [‎7/‎19/‎2016 11:28 AM] Balaji Patakula:

    yes

    [‎7/‎19/‎2016 11:28 AM]

    Ok.

    [‎7/‎19/‎2016 11:30 AM] Balaji Patakula:

    my take is just comment the @Controller. so that we can bring them back when we need them

    but check with Aaron/Matt if they want u to remove the whole code

    [‎7/‎19/‎2016 11:30 AM]

    Sounds good to me. Fireing up everything now just to see if the swagger works.

    This site can’t be reached

    localhost refused to connect.

    [‎7/‎19/‎2016 11:32 AM]

    No errors other than this on Eureka: 2016-07-19 11:29:28,785 ERROR [main] com.netflix.discovery.DiscoveryClient DiscoveryClient_EUREKA/PFELDMAN-NCS

    [‎7/‎19/‎2016 11:33 AM] Balaji Patakula:

    can u do localhost:8070/gtc-server/user

    [‎7/‎19/‎2016 11:34 AM]

    {“firstName”:”Anonymous”,”lastName”:”Anonymous”,”email”:”anonymous@example.com”,”subject”:”user”,”roles”:”eip_admin”}

    [‎7/‎19/‎2016 11:35 AM] Balaji Patakula:

    ok, so…problem looks like 8800 ?? on swagger?

    [‎7/‎19/‎2016 11:35 AM]

    Could be – that’s what’s in the config file. Can I change it to something less common?

    [‎7/‎19/‎2016 11:36 AM] Balaji Patakula:

    check with Matt, because he is working on disabling the swagger in QA and Prod env

    may be he changed something that takes it away from local too???

    [‎7/‎19/‎2016 11:36 AM]

    No, I mean for testing.

    On my machine

    Oh, I see – you think it may already be disabled.

    [‎7/‎19/‎2016 11:37 AM] Balaji Patakula:

    yes

    he mentioned in the standup that he disabled it

    and only enabled it on CI

    may be disabled for local too

    [‎7/‎19/‎2016 11:38 AM]

    Yeah, I heard that.

    [‎7/‎19/‎2016 11:38 AM] Balaji Patakula:

    check with him, but u can also try with one controller on CI

    and see the change

    [‎7/‎19/‎2016 11:39 AM]

    This is CI, right? http://dockerapps.eip.nj.vistronix.com/gtc-server/swagger-ui.html

    [‎7/‎19/‎2016 11:40 AM] Balaji Patakula:

    yes

    [‎7/‎19/‎2016 11:40 AM]

    So back to my question. How do I test on my local machine?

    [‎7/‎19/‎2016 11:41 AM] Balaji Patakula:

    that swagger UI should be working locally and u can test with that

    so, ask Matt to see why it is not working locally

    [‎7/‎19/‎2016 11:42 AM]

    Alright, I’ll check with him.

  • And here’s the thread with Matt:[‎7/‎19/‎2016 11:55 AM]So I’m working on the gtc task and was going to use swagger as a proxy for detecting if the services were disabled.

    The thing is, swagger works on CI, but not on my box.  Is it still enabled?

    [‎7/‎19/‎2016 11:59 AM]

    Gotta step out for a bit.

    [‎7/‎19/‎2016 12:03 PM] Martin, Matthew:

    so T and Vinny stopped by.  Yesterday I delivered a change to the proxy and gateway that diables swagger by default.  i just updated the wiki page on how to enable it when running the microservice locally

    https://visconfnetc.vistronix.com:8443/display/SPD/REST+API+Documentation

    [‎7/‎19/‎2016 12:04 PM]

    I’ll take a look.

    [‎7/‎19/‎2016 12:04 PM] Martin, Matthew:

    its basically this: java -Dspring.profiles.active=swagger -jar build\libs\microservice.war

    [‎7/‎19/‎2016 12:06 PM]

    I just need to do this for gtc-server.war, right?

    [‎7/‎19/‎2016 12:14 PM]

    Ok, so I’m trying http://localhost:8800/gtc-server/swagger-ui.html and getting This site can’t be reached

    localhost refused to connect.

    We saved this conversation. You’ll see it soon in the Conversations tab in Skype for Business and in the Conversation History folder in Outlook.

    [‎7/‎19/‎2016 2:04 PM] Martin, Matthew:

    here’s how I run the gateway microservice locally:

    C:\fusion\GTC\EIPGateway

    java -jar -Dspring.profiles.active=swagger build\libs\gtc-server.war

    here’s how i access swagger:

    http://localhost:8070/gtc-server/swagger-ui.html

    [‎7/‎19/‎2016 2:04 PM]

    I think I may have had an older version of the code. downloading and rebuilding now. We’ll see how that goes.

    :8070? Where is that configured?

    [‎7/‎19/‎2016 2:06 PM] Martin, Matthew:

    \src\main\resources\application.yml

    also defaults are listed here: https://visconfnetc.vistronix.com:8443/display/SPD/MicroServices

    [‎7/‎19/‎2016 2:13 PM]

    Got it!

    [‎7/‎19/‎2016 2:14 PM] Martin, Matthew:

    (*)

    We saved this conversation. You’ll see it soon in the Conversations tab in Skype for Business and in the Conversation History folder in Outlook.

    [‎7/‎19/‎2016 3:08 PM]

    Based on Balaji’s suggestion, I just commented out the @Controller annotation for the specified services. The services are no longer in the swagger UI. I’ve promoted to QA.

    Since I don’t have permissions on the Jira to adjust status…

    [‎7/‎19/‎2016 3:09 PM] Martin, Matthew:

    Now that you’ve confirmed that it works please delete the associated code/files.

    [‎7/‎19/‎2016 3:10 PM]

    Balaji suggested that it might be a good idea to keep them. So you’d prefer that I delete the files instead?

    [‎7/‎19/‎2016 3:11 PM] Martin, Matthew:

    yes.  i think it was a good idea to test with comments, but I’d like to have it deleted in the end

    [‎7/‎19/‎2016 3:11 PM]

    Ok. Will get started on that.

    [‎7/‎19/‎2016 3:12 PM] Martin, Matthew:

    ok thanks Phil.

    [‎7/‎19/‎2016 3:13 PM]

    Just to be sure, that’s the code in the web and service directories (e.g. AlertServiceController and AlertServiceClient)

    [‎7/‎19/‎2016 3:15 PM] Martin, Matthew:

    yes

     

Phil 7.18.16

7:00 – 3:30 VTX

  • Writing and reworking Lit Review 2. After that, I need to rework the research plan so that RQs and Hs are interchanged.
  • Meeting with Ned Thursday evening?
  • Meeting with Thom second week of August.
  • If there is time today, try to add color change to the table cells to reflect rank. Failing that, add a column that shows relative motion? Both?
    • Added a Rank and Delta field. That seems to be working fine.
  • Finished lockout task
  • Starting Gateway exposes old APIs task

Phil 7.15.16

9:00 – 5:00 VTX

  • Spent most of the morning figuring where and how to run the login lockout tests. Finally ran everything successfully and wrote up a test script.
  • Added a slider to CorpusManager
  • Updated IntelliJ and tested

Phil 7.14.16

8:30 – 5:00

  • Great vacation – Added simulation to the list of Things to do…
  • Much restructuring on the project. I’m going to do security for a couple of weeks?
  • Waiting for testing accounts and confluence pages for security?
    • I think it’s this: https://visconfnetc.vistronix.com:8443/display/EP/EIP+V1.0+Security
  • Meeting with Aaron Massey – reviewed progress. He may have some data that looks at the intersection between requirements and regulation.

Phil 6.24.16

7:00 – 3:30 VTX

  • Thinking about Predictwise’s ostensible failure to get the Brexit right. (Brexit – PredictWise-6.24.2016). The comments are also hugely interesting. Added it to the corpus and coded it lightly. Don’t want to lose it. From the Predictwise blog:
    • Second, market did not pick up on enough idiosyncratic data in the field. Maybe this is because traders do not have the pulse of working masses? Possible. But, I go back to the first point as probably more important. This morning I have a new line of research that I am obsessed with: political impact on financial markets is under-explored and fascinating. I suspect that we underestimated the effect that the volatility of the underlying currency had on the prediction and financial markets.
  • Back to creating friction and refining that part of the contribution. I need to find that article that does fact-checking by looking at Wikipedia hops. Found it: Computational Fact Checking from Knowledge Networks
  • Writing report. Done!

Phil 6.22.16

6:45 – 4:45 VTX

  • Running analytics on the CSCW corpora
  • My codebase at home was out of data, and I was having my missing bouncycastle jar file issue, so I updated the development folders. I also started updating my IntelliJ, which has taken 10 minutes so far…
  • First pass of running the CSCW data through the tool.
    There are three categories:
    • CSCW17 – these are the submitted papers
    • MostCited – These are (generally) the most cited paper by the author where they are first or last author. It took me a while to start doing this, so the set isn’t consistent.
    • MostRecent – These are the most recent papers that I could get copies of. Same constraints and caveats as above.
    I also deleted the term ‘participants’, as it overwhelmed the rest of the relationships and is a pretty standard methods element that I don’t think contributes to the story the data tell.
    Here’s the top ten items, ranked by influence of terms inside of the top 52 items in the LSI ranking. It’s kind of interesting…
    CSCW2017 Most Cited Most Recent Most Cited Most Recent
    older social media Sean P. Goggins.pdf Donald McMillan.pdf
    ageism student privacy Chinmay Kulkarni.pdf Mark Rouncefield.pdf
    adult photo twitter Airi Lampinen.pdf Sarah Vieweg.pdf
    blogger awareness behavior Cliff Lampe.pdf Jeffrey T. Hancock.pdf
    ageist object device Anne Marie Piper.pdf David Randall.pdf
    platform class interview Frank Bentley.pdf Cliff Lampe.pdf
    workplace facebook notification Mor Naaman.pdf Sean P. Goggins.pdf
    woman friend deception Morgan G. Ames.pdf Airi Lampinen.pdf
    gender flickr phone Gabriela Avram.pdf Wayne Lutters.pdf
    snapchat software facebook Lior Zalmanson.pdf Vivek K. Singh.pdf
  • Finished rating! 530 pages. Now I need to get the outputs to Excel. I think the view_ratings should be enough…?
  • I don’t have just names alone, but I’m going to assume that the initial set of queries (‘board actions’, ‘criminal’, ‘malpractice’ and ‘sanctions’) may modestly improve the search. So with a proxy for the current system, with my small data set, I have the following results:
    • Hits or near misses – 46 pages or 16.7% of the total pages evaluated
    • Misses – 230 or 83.3% of the total pages evaluated

    With the new CSE configuration (exactTerms=<name permutation>, query=<full state name>, orTerms=<TF-IDF string>, we get much better results:

    • Hits or near misses – 252 pages or 78% of the total pages evaluated
    • Misses – 71 or 22% of the total pages evaluated

    So it looks like we can expect something on the order of a 450% improvement in results.

  • Good presentation on document similarity

Phil 6.21.16

7:00 – 5:00 VTX

  • Finished MostRecent.
  • Checked Data directory into SVN
  • Testing rating algorithms. Seems to be working pretty well 🙂
  • Rated all day. Should finish tomorrow.
  • Worked through paragon and fallen angel patterns with Aaron. Pulled out by bayesian spreadsheets and realized I no longer understood them…

Phil 6.20.16

7:00 – 7:00 VTX

  • Building chair corpus = Current and Cited
  • Filled MostCited.
  • Rating a few more pages. Still not getting any name hits.
  • Going to advanced search and entering items into each field, I get a different looking query:
    https://www.google.ca/search?as_q=New+York&as_epq=Nader+Golian&as_oq=+license+board+practice+patient+physician+order+health+practitioner+medicine+medical
    • These seem to be the important differences
    • as_q=New+York — This is a ‘normal’ query
    • as_epq=Nader+Golian — This must be in the results
    • as_oq=+license+board+practice+patient+physician+order+health+practitioner+medicine+medical — at least one of these must be in the result
  • Going to add a test to look for the name in the query (and the state?) and at least check the NA box and throw up a dialog. Could also list the number of occurrences by default in the notes

1:00 – Patrick’s proposal

  • Framing of problem and researcher
  • Overview of the problem space
    • Ready to Hand
    • Extension of self
  • Assistive technology abandonment
    • Ease of Acquisition
    • Device Performance
    • Cost and Maintenance
    • Stigma
    • Alignment with lifestyles
  • Prior Work
    • Technology Use
    • Methods Overview
      • Formative User Needs
      • Design Focus Groups
      • Design Evaluation and Configuration Interviews
    • Summary of Findings
    • Priorities
      • Maintain form factor
      • Different controls for different regions
      • Familiarity
      • Robustness to environmental changes
    • Potential of the wheelchair
      • Nice diagram. Shows the mapping from a chair to a smartphone
    • Inputs to wheelchair-mounted devices
    • Force sensitive device, new gestures and insights
    • Summary (This looks like research through design. Why no mention?)
      • Prototypes
      • Gestures
      • Demonstration
  • Proposed Work
    • Passive Haptic Rehabilitation
      • Can it be done
      • How effective
      • User perception
      • Study design!!!
    • Physical Activity and Athletic Performance
      • Completed: Accessibility of fitness trackers. (None of this actually tracks to papers in the presentation)
      • Body location and sensing
      • Misperception
        • Semi-structured interviews
        • Low experience / High interest (Lack of system trust!)
    • Chairable Computing for Basketball
      • Research Methods
        • Observations
        • Semi-structured interviews
        • Prototyping
        • Data presentation – how does one decide what they want from what is available?
  • What is the problem – Helena
    • Assistive technologies are not being designed right. We need to improve the design process.
    • That’s too general – give me a citation that says that technology abandonment WRT wheelchair use has high abandonment
    • Patrick responds with a bad design
    • Helena – isn’t the principal user-centered design. How has the HCI community done this before WRT other areas than wheelchairs to interact with computing systems
    • Helena – Embodied interaction is not a new thing, this is just a new area.Why didn’t you group your work. Is the prior analysis not embodied? Is your prior work not aligned with this perspective
  • How were the design principles used o develop an refine the pressure sensors?

More Reading

  • Creating Friction: Infrastructuring Civic Engagement in Everyday Life
    • This is the confirming information bubble of the ‘ten blue links’: Because infrastructures reflect the standardization of practices, the social work they do is also political: “a number of significant political, ethical and social choices have without doubt been folded into its development” ([67]: 233). The further one is removed from the institutions of standardization, the more drastically one experiences the values embedded into infrastructure—a concept Bowker and Star term ‘torque’ [9]. More powerful actors are not as likely to experience torque as their values more often align with those embodied in the infrastructure. Infrastructures of civic engagement that are designed and maintained by those in power, then, tend to reflect the values and biases held by those in power.
  • Meeting with Wayne. My hypothesis and research questions are backwards but otherwise good.

Phil 6.17.16

8:00 – 3:00 VTX

  • Finishing up the chairs. And then I have some more coding to do on the new papers.
  • Anatomy of the Unsought Finding. Serendipity: Origin, history, domains, traditions, appearances, patterns and programmability (stored here)
  • Supporting serendipity: Using ambient intelligence to augment user exploration for data mining and web browsing (stored here)
  • Starting to create new json file of practitioners
    {
    	"first_name": "Nader",
    	"last_name": "Golian",
    	"pp_state": "New York",
    	"gender": "male",
    }, {
    	"first_name": "Ata",
    	"middle_name": "Ollah",
    	"last_name": "Mehrtash",
    	"pp_state": "New York",
    	"gender": "male",
    }, {
    	"first_name": "Souheil",
    	"last_name": "Saba",
    	"pp_state": "New Jersey",
    	"gender": "male",
    }, {
    	"first_name": "Kamal",
    	"last_name": "Patel",
    	"pp_state": "Illinois",
    	"gender": "male",
    }
  • Built new queries
    sst = new SmartSearchTerm("schedule guideline license substance board sentence increase other prescription commission", null, null);
            termList.add(sst);
            sst = new SmartSearchTerm("schedule criminal license sentence prescription doctor defendant board practice research county", null, null);
            termList.add(sst);
            sst = new SmartSearchTerm("license board practice patient physician order health practitioner medicine medical", null, null);
            termList.add(sst);
            sst = new SmartSearchTerm("physician license professional return number effective", null, null);
            termList.add(sst);
            sst = new SmartSearchTerm("respondent consent committee probation agreement pursuant", null, null);
            termList.add(sst);
  • Hmm. Getting good pages that are like this, but no matches on the names yet. Tempted to have an option that rejects pages that don’t have an entity with the right name in it and let it cook.
  • Helped write up ML text for SOW.

Phil 6.16.16

7:30 – 5:30 VTX

  • Still working through chairs. It’s been a really good exercise. I’m finding some very good stuff.
  • Worked through equations that would help find paragons, oscillators, cyphers and criminals.
  • Committed all the changes that I forgot about yesterday.
  • “name” <state> license board practice patient physician order health practitioner medicine medical seems to work well.
  • Building auto-trimmed matrix based on 50% of rank value. Done!

Phil 6.15.16

7:00 – 10:00, 12:00 – 4:00 VTX

  • Got the official word that I should be charging the project for research. Saved the email this time.
  • Continuing to work on the papers list
  • And in the process of looking at Daniele Quercia‘s work, I found Auralist: introducing serendipity into music recommendation which was cited by
    An investigation on the serendipity problem in recommender systems. Which has the following introduction:

    • In the book ‘‘The Filter Bubble: What the Internet Is Hiding from You’’, Eli Pariser argues that Internet is limiting our horizons (Parisier, 2011). He worries that personalized filters, such as Google search or Facebook delivery of news from our friends, create individual universes of information for each of us, in which we are fed only with information we are familiar with and that confirms our beliefs. These filters are opaque, that is to say, we do not know what is being hidden from us, and may be dangerous because they threaten to deprive us from serendipitous encounters that spark creativity, innovation, and the democratic exchange of ideas. Similar observations have been previously made by Gori and Witten (2005) and extensively developed in their book ‘‘Web Dragons, Inside the Myths of Search Engine Technology’’ (Witten, Gori, & Numerico, 2006), where the metaphor of search engines as modern dragons or gatekeepers of a treasure is justified by the fact that ‘‘the immense treasure they guard is society’s repository of knowledge’’ and all of us accept dragons as mediators when having access to that treasure. But most of us do not know how those dragons work, and all of us (probably the search engines’ creators, either) are not able to explain the reason why a specific web page ranked first when we issued a query. This gives rise to the so called bubble of Web visibility, where people who want to promote visibility of a Web site fight against heuristics adopted by most popular search engines, whose details and biases are closely guarded trade secrets.
    • Added both papers to the corpus. Need to read and code. What I’m doing is different in that I want to add a level of interactivity to the serendipity display that looks for user patterns in how they react to the presented serendipity and incorporate that pattern into a trustworthiness evaluation of the web content. I’m also doing it in Journalism, which is a bit different in its constraints. And I’m trying to tie it back to Group Polarization and opinion drift.
  • Also, Raz Schwartx at Facebook: , Editorial Algorithms: Using Social Media to Discover and Report Local News
  • Working on getting all html and pdf files in one matrix
  • Spent the day chasing down a bug where if the string being annotated is too long (I’ve set the  number of wordes to 60), then we skip. THis leads to a divide by zero issue. Fixed now