Course Description

Most, if not all, of today’s grand challenges (e.g., food, water and energy security) can be described spatially from regional to global scales and, while several individual disciplines contend to address these challenges, there is one key factor that they all have in common: the need for data. Despite our being in an age rich in data, many of the critical datasets needed for our understanding and prediction of our world are, in fact, quite limited.

In this capstone course, you will get the opportunity to utilize your Python programming skills (writing scripts and creating subroutines) to connect to various types of data (e.g., GeoJSON, ASC, HDF5 and NetCDF), synthesize these data to unlock new understanding (using methods such as spatial scaling and gap-filling), create visualizations using open-source GIS software, and present to the world your own story of spatial data discovery professionally written for the web. I will take you through my own journey of data discovery, provide you with the know-how for accessing large data repositories, demonstrate methods for data harmonizing, processing, modeling and visualizing, and challenge you to think spatially.

Catalog Number
  • DATA 431
  • Section 1
  • CRN 14948
Prerequisites
  • CSCI 140/141 or DATA 140/141
Semester
  • Fall 2021 (1 September 2021–21 December 2021)
Location
Class Times
  • MW 1400-1520
Instructors
Office Hours
  • Davis: Mo. & We. 1200–1300; Th. 0930–1030
  • Also by appointment
  • Appointments are welcome outside normally scheduled office hours; please email to set up a time.
  • There may be certain days when office hours will be either canceled or rescheduled; notifications will be sent ahead of time.
Delivery
  • This seminar style class has a labeled modality of FS, which means that it meets predominantly in person (face-to-face) and predominantly synchronous (all meet at the same scheduled place at the same scheduled time).
  • As a seminar, class sizes are held to less than 20 students and are discussion intensive with a variety of typed, coded, and oral assignments.
Final Exam
  • Blackboard-style assessment; opens at 14:00 (2 PM) 21 December 2021
Minimum Passing Grade
  • D
Communication
  • Instant messaging using Slack
    • For casual communication of content or for questions that need quick responses
    • Slack workspace URL: https://spatialdatadiscovery.slack.com
    • You are not required to use Slack; however, this platform will provide you with direct messaging access to the instructor and your peers. If you are interested, please create an account using your W&M email. An invitation link to the workspace will be made available.
  • Discussion posts and issue tracking using GitHub
    • For sharing ideas, methods and content and asking/responding to questions, comments or concerns
    • You must create a GitHub account, if you do not already have one.
  • Video conferencing (Zoom)
    • Video conferencing is for remote class meetings, office hours, video chats, and any other meetings when in-person communication is not possible
  • Email
    • The new snail mail; use this for personal communication or whenever sharing with the class is inappropriate
  • Blackboard (https://blackboard.wm.edu)
    • A one-stop shop for course content, schedule and assessment; links may take you outside of Blackboard (e.g., to GitHub)
  • Organization Website (https://spatial-data-discovery.github.io/)
    • The public-facing organization site is for Spatial Data Discovery class; you will be added to the GitHub Development Team as a Coder and you will have your final project uploaded and displayed here.
    • The content for site is maintained by your instructor, but you will also be granted editing privileges and may be required to upload files to the associated repository (see The Organization Repository)
  • GitHub Team Website (https://github.com/orgs/spatial-data-discovery/teams/f21-developers)
    • Our team website with discussion boards for posting assignments and prompts for class
  • Repositories
  • Videos (YouTube Playlist)
    • Complimentary to course content and used when audio/video is the better communication medium

See Code of Conduct regarding how these communication platforms are (and should be) used.

Textbook
  • Today’s data science textbook is the World Wide Web itself
Course Materials
  • Laptop (required)

    * Your laptop should have at least 500 MB of free disk space, have at least 8 GB of memory, and run a modern desktop OS (e.g,. PC, Mac, or Linux).

    * Note that you are also required to read, write and execute Python code, whether installed natively on your computer or via a cloud computing platform (e.g., JupyterHub or Google Colaboratory)

  • GitHub account (required)

  • W&M Blackboard (required)

  • Slack account (recommended)

Technology
  • Announcements, discussions, course calendar, grades, etc., may be accessed through the university’s Blackboard; please make certain you have access to it.

  • Coding exercises, markdown documents and associated files should be submitted using Git to one of two repositories found on the class’s GitHub Page.

    * You will need to create a GitHub account, if you do not already have one.

    * You are granted write access to both the semester and organization repositories; with power comes responsibility—please use it wisely!

  • Scripting assignments will be tested and graded using Python 3.8; please be sure you have a recent version installed.

  • Web development assignments will be requested in Markdown syntax; your instructor will compile your Markdown using the latest releases of R and pandoc.

    * While you are not required to use R or pandoc, you may install these software packages to test your markdown pages before submitting.

  • Geographic visualizations will be rendered using either the latest stable release or the university’s current installed version of QGIS or Panoply. Please indicate the software versions you install to your professor.

    * All software used in this class are available for all major OSs (e.g., Linux, macOS, and Windows) and are available under free software licenses. QGIS is available in select campus computer labs (check for availability).


Course Objectives

The College 400 capstone experience requires students to take initiative in synthesis and critical analysis, to solve problems in an applied and/or academic setting, to create original material or original scholarship, and to communicate effectively with a diversity of audiences. In this capstone course, students will take on these challenges through the lens of spatial data science.

Students will learn how to access, read, and parse disparate data sources from a variety of open-access international, governmental, and private organizational databases and will learn about the broad challenges associated with spatial data. Students will have an opportunity to tackle these issues using a variety of analytical and statistical techniques. Students will communicate their findings through the web by creating a visualization accompanied by a written summary.

Learning Expectations

  1. To enhance proficiency working with disparate data sources and formats (e.g., EXIF, JSON, CSV, ASC, HDF, NETCDF)
  2. To improve data literacy through data exploration and metadata analysis
  3. To become a more self-sufficient data scientist
  4. To assign meaning to spatial data through spatial analytics
  5. To take initiative in synthesis and critical analysis (COLL 400 LE1)
  6. To become more familiar with topics, challenges, and solutions to real-world data science problems (COLL 400 LE2)
  7. To communicate spatial data using original methods and visualizations (COLL 400 LE3)
  8. To better communicate your science with peers and the general public (COLL 400 LE4)
  9. To develop and express ideas in writing that uses genres, styles, and technologies appropriate to the content and audience and incorporates texts, data, and images. (COLL 400 LE5—Major Writing Requirement)

Never be afraid to show your colleagues your work. If you are afraid to, that’s probably the bigger sign that something is wrong with your code and that you already know it.”

N. Schweitzer (2008), Software Consultant, Wauwatosa, WI.

Student Expectations

  1. Participate in the weekly readings, activities (e.g., sandbox challenges and work journal), and discussions
  2. Attempt weekly quizzes until a score of at least 60% is achieved on each
  3. Attempt the final exam
  4. Submit two website pages:
    • An “about the coder” page (draft and update w/ utility script)
    • A “data discovery” page (draft and final w/ image and/or animation)
  5. Provide constructive feedback (e.g., during class discussions, reviews, and course evaluations)

Topics

The following are the proposed topics to be covered in this class along with some example discussions.

  • Spatial data: what it is and why it is important
  • Form and function: a look at different data file types and why they are used
  • Data access: the API movement and how it impacts us
  • Data discovery: the true purpose of metadata
  • Data analysis: statistical analysis, modeling, and optimization
  • Data visualization: geographic information systems as a tool

Class Structure

This class structure will be twofold:

  1. I will take you on my journey through spatial data discovery from the perspective of one of my active research projects. You will experience real world data and be challenged to think critically and to find analytical and/or programmatic solutions to professional research-grade questions.
  2. You will be given the task to discover data in your own way—making the best use of the resources and methods presented to you—and relay your methods and findings with a web-based presentation that includes a succinct message for a broad audience as well as a meaningful visualization of your data.

Assessment

Anytime you devote to practice, you haven’t lost. You’re a winner.”

– Bob Ross

Course Requirements

This course will consist of the following tasks.

  1. Classroom Discussions [LE #6,8]
  2. Work Journal [LE #3,8,9]
  3. Quizzes and Exam [LE #1,2,4,5]
  4. Discovery Project [LE #7,8,9]

The completion (or attempted completion) of all these tasks will result in at least a passing grade (D). A failure of any one of these tasks will result in a failing grade (F) for the course.

Failure Criteria

The following definitions will be used to determine a failure status for any task listed under the Course Requirements.

Classroom Discussions

Discussions will be held, on average, once a week. A student that misses 50% or more of the discussions (e.g., six or more) will receive a failing grade.

Most thinking involves collaborating with other people. That’s why scientists have lab meetings, why doctors consult with specialists, and why it’s important to have someone to talk to when you’re confused or upset. […] A little social support can generate a lot of confidence.”

– Steven Sloman, Professor of cognitive, linguistic and psychological sciences at Brown University

Work Journal
Students are required to publish an up-to-date work journal throughout the semester. A student that fails to submit even one journal entry by the last day of class will receive a failing grade for the class.
Weekly Quizzes
Weekly Blackboard quizzes are assigned to assist you with the completion of the final exam. Each missed (scoring <60%) or skipped quiz is counted as incomplete; additionally, each quiz completed following a missed or skipped quiz is counted as incomplete (i.e., all quizzes must be completed in order). A student with more than 10 incomplete quizzes will receive a failing grade.
Final Exam
Students are required to submit their responses to the final exam on or before the scheduled exam period (see Course Calendar). A student that does not attempt the final exam will receive a failing grade for the class.
Discovery Project
This class focuses on discovering spatial data, learning how to manipulate it, and visualize it. A student that does not turn in at least a draft of the discovery project will receive a failing grade.

Grading Components

This course is graded on a 100-point scale. The distribution of these points are awarded across the following five tasks. Students may choose either the Final Exam or the Discovery Project as their majority grade (i.e., 75%). Whichever task is chosen for the majority grade, the lower percentage weight will be given to the other task (i.e., 1%). Students are expected choose which task will be their majority grade before taking the final exam.

  1. Final Exam (1% or 75%): Students are asked to complete a singe-submission comprehensive final exam on Blackboard, which is due no later than at the end of the scheduled exam period determined by the university registrar.

  2. Weekly Quizzes (15%): Students are asked to complete the 15 weekly quizzes, which are designed as preparation for the final exam (i.e., quiz 16). Each quiz is a cumulative assessment (it takes into consideration each previous quiz) and is assessed with equal weighting. The due date for all quizzes is the end of final exam period. Each quiz may be attempted multiple times, allowing for practice and/or improvement, where the highest grade from all attempts is recorded. A grade of at least 60% must be achieved for a quiz to be considered completed. All quizzes must be completed in order (no skipping).

  3. Classroom Discussions (5%): Students are asked to engage in weekly discussions, where various topics, methods, theories, approaches and ideas are debated and discussed. In most cases, discussions will take place during class. Each discussion week, a student is expected to act as the scribe, taking notes of the classroom discussion and posting them online for anyone who misses class or has something more to add to the conversation. Participation is defined as engaging in either the classroom or online discussion post, which should be made within 10 working days of the discussion’s original posting. Follow-ups, clarifications, or additions may be submitted at any time. Students that participate in at least 12 discussions (or ten discussions and act as a scribe at least once) receive credit for this task.

  4. Work Journal (4%): Students are asked to keep a semester-long journal of both their in and out of class activities. This type of journal is intended to improve organization and time management skills and is often utilized in industry for the completion of time sheets or billable hours. Students are asked to submit weekly updates of their work journal, which should include at minimum:

    • summary of activities (e.g., bullet list)
    • explanation of activities (more thorough details than summary)
    • calendar or checklist or prioritized list of upcoming tasks

    A student completing at least 12 weekly journal entries receives credit for this task.

  5. Discovery Project (1% or 75%): Students are to create content for a public-facing website that shares their discovery of data using various sources, methods, and visualization techniques. Goals of this project include:

    • Finding new data that excites you
    • Applying any processes/algorithms to the data necessary for your visualization; note that all processes and algorithms must be accompanied by a description of methods sufficient for reproducibility
    • Creating a visualization of the data
    • Sharing your discovery with the general public, explaining where you found your data, why you find it exciting, how you manipulated it for this project

    To complete the project, the following will be assessed:

    • Front-end (markdown script and image file)
    • Back-end (python script, data files, and reproducible process)
    • Web visualization & creativity

    Students will be required to submit parts of their final project by email and to both the Organization Repository and Semester Repository. Information about what materials should be submitted (and how) are given in the README of the project folder in the Semester Repository.

Non-Graded Components

  1. Students are expected to attempt a series of sandbox challenges designed to assist in learning the Git workflow and to receive feedback on various coding practices.
  2. Students are expected to complete in-class activities (e.g., the Spatial Dilemma Challenge and the Sparse Data Challenge), which may be assessed in a discussion post or by an ungraded evaluation.
  3. Students are expected to submit drafts of their website pages (i.e., the About Page and Data Discovery) for ungraded evaluation and feedback.

Grading Scale

The final letter grade is based on the point total of completed Grading Components. The range for each letter grade is given in the following table (three-point ranges for intermediate steps and four-point ranges for whole letter grades). The expectations of students for selected grade levels are also described below.

The instructor reserves the right to adjust a student’s final grade by one-half step (e.g., a student that received a score that would give them a C, it could be raised to a C+ or lowered to a C–), so long as the final grade is not lowered below a D or raised higher than an F.

Letter Quantitative Qualitative
A >93.0 Superior mastery
A– 90.0–92.99
B+ 87.0–89.99
B 83.0–86.99 Good mastery
B– 80.0–82.99
C+ 77.0–79.99
C 73.0–76.99 Satisfactory achievement
C– 70.0–72.99
D+ 67.0–69.99
D 60.0–66.99 Less-than-satisfactory achievement
F <60.0 Unsatisfactory achievement
Superior mastery
  • not only attends every class, but actively engages with both the instructor and other students
  • cares about their own learning as well as the learning of their fellow students
  • asks questions
  • seeks knowledge outside of the classroom
  • demonstrates comprehension of course material
  • communicates clearly and understandably
  • challenges the knowledge of the instructor and fellow students
  • does more than the minimum requirements outlined in class
Satisfactory achievement
  • attends the majority of classes
  • participates in classroom discussions
  • completes most to all weekly tasks
  • occasionally asks questions to clarify concepts
  • completes all or most coursework demonstrating basic competency of the material
Less-than-satisfactory achievement
  • minimal contact with the instructor
  • pays some attention during class
  • does not complete all the readings or assignments
  • participates little to not at all in classroom discussions
  • demonstrates minimal effort with assignments
  • term project is incomplete

Course Calendar

In this classic fifteen week semester, each week will consist of at least one discussion topic with associated readings and responses and up to one “sandbox challenge” assignment. These ungraded assignments are meant to stimulate your creativity and problem-solving skills that will help you with addressing the graded assignments and final project.

The schedule for discussion meetings, lectures, and assignments is posted online here. The dates in the calendar are tentative; all changes to the schedule will be announced in class and through Blackboard’s announcement service and Slack. Students are responsible for ensuring an awareness of any such announcements. In other words: you must read your campus email and check Blackboard and Slack regularly!

Important Dates

First day of term
  • 1 September 2021
Add/drop deadline
  • 10 September 2021
Fall break
  • 16–19 October 2021
Withdraw deadline
  • 1 November 2021
Last day of term
  • 10 December 2021
Day of final exam
  • 21 December 2021

Standards & Policies

The Code of Conduct

By the basis in which this class is designed, these things hold true:

  1. This class utilizes several forms of communication (e.g., instant messaging, discussion boards, wikis, emails, audio/video conferencing), of which not all can be monitored by the instructor at all times.
  2. You are responsible for how you interact with each other.

Remember these things as you work together:

  1. “Don’t ascribe maliciousness to that which can be explained by inadvertence.”

    This comes from the fact that it is almost impossible to portray our feelings or intended meaning behind typed text. If something offends you, take a breath, be cordial and ask for clarification before unleashing your wrath (BTW: you shouldn’t unleash your wrath). That being said, also do not be a silent witness. If something offends you, let everyone know. We will never learn from our mistakes if our mistakes are never pointed out. If malicious actions continue, the instructor will manage it.

  2. “There is no innovation and creativity without failure. Period.”

    You are a college student registered in a college class. You are not expected to know everything. The entire purpose of this exercise is for you to gain knowledge, so make an effort. If you want to try something, try it and let everyone know what you are up to. Best case scenario, your innovations spark new insight. Worst case scenario, we all learn something from your efforts. It’s better to aim high for something that will make a difference rather than to play it safe with something easy.

  3. Ask lots of questions.

    Questions are cheap, so ask a lot of them. When asking questions, remember to always strive for clarity. If you don’t know something or your aren’t sure, just ask. Sometimes, knowing the right question to ask is just as difficult as finding the answer. When you find yourself here, please send up a flare or simply say “I’m lost.” We will get you back on track.

  4. Focus on opportunities.

    Remember: this is not a race and you are not a judge, so don’t get caught up with critiquing or competing with each other. Provide your opinions and perspectives and then actually take the time to read the opinions and perspectives of others. Challenge yourself to see things differently and try things differently. Ignore your desire to be correct or to correct someone else and try not to contradict people; they don’t like it and biologically it shuts down their ability to see things logically.

  5. Document and share everything.

    Reproducible science should not be an afterthought and neither should the reproducibility of your coursework. While it may feel natural to keep your work private, in the real world projects really thrive when you document your process publicly. Have you ever searched for answers on the internet? Did it help? By writing things down and sharing them, more people can participate along the way and you might get help on something you didn’t even know you needed. This leads to more things being documented, which produces a better project roadmap, which leads to better transparency and feedback, which leads to good decision-making and faster/better results.

  6. Everyone is bound to uphold a policy of respect for their instructor and their peers. Students should be open-minded to new ideas and participate in collegiate debate, the sharing of ideas, and the receiving of feedback without defamatory remarks. Students should help maintain a healthy learning environment by refraining from negative behavior, such as harmful remarks, quibbling over trivial matters, creating unnecessary debates, or bullying.

    There is zero tolerance for negative behavior. Failure to uphold this policy will result in punitive action and/or removing the offending student from access to all or part of the class.

Engagement Policy

  • Weekly engagement is expected.

In this class, there is a scheduled time and place for weekly meetings. Engagement is therefore defined loosely as your interaction with the instructor either by coming to class or using one or more of the communication tools listed above (see Communication).

It is expected that you interact weekly through class attendance, posted discussions, chats, issues, emails, and/or video conferencing. Failure to do so may influence your final grade (see Failure Criteria).

Please make any and all planned absences (e.g., field trips, vacations, athletic events) that would result in you not being able to meet this minimum interaction known to your instructor at the earliest possible time.

Coursework Policy

  • You are bound by the honor code.

By accepting admission to the College, you have made a commitment to understand, support, and abide by the Honor Code. Violations, whether attempted or successful, will result in consequences ranging from a grade of zero for the assignment up to a failing grade for the class.

Misconduct may include, but is not limited to, the following:

  • cheating or using unauthorized materials on assignments
  • fabrication, forgery, alteration, or destruction of documents; hacking unauthorized resources; intimidating or bribing peers; improper collaboration or colluding; plagiarizing; or lying in order to obtain academic advantage
  • assisting others in misconduct
  • attempting misconduct

Instant Messaging Policy

  • The instant messaging app should be used for communication that either: (1) needs to reach people quickly or (2) needs a quick response.

You are free (and encouraged) to create your own chat groups with classmates for better/faster communication. Please do not use the instant messenger for spamming, soliciting or otherwise disrupting the peace. The instructor is free to mute any and all messages during “unsociable” hours and therefore, may not always respond instantly; you are free to do the same.

Email Policy

  • All personal correspondence should be made to the instructor’s W&M email address (see above) and include the class title in the subject line.

For specific questions, raise an issue on GitHub. For private inquiries, please email the instructor; the instructor will confirm each email received. If you do not receive a confirmation message from the instructor within 12 hours of sending, you may send a follow-up email.

Rework Policy

  • The final due date for all work is the final hour of the final exam period.

Weekly quizzes and the final exam become unavailable following the final hour of the scheduled final exam period.

Weekly discussion responses should not exceed 10 working days following their initial posting. Original submissions to discussion posts (e.g., not a follow-up to a question or a clarification) occurring after the 10 working day limit that did not receive prior permission for late submission may not be counted as complete.

Students are expected to create two web pages for the course website: (1) about the coder and (2) spatial data discovery. Errors in formatting, omissions of content, or other issues may be flagged by the instructor for correction. Please respond within three (3) business days with the requested corrections or a request for an extension (providing reasoning).

Electronics Policy

  • Personal electronics use that is not directly related to the learning objectives of the class is not permitted.

Students may keep cellular phones on, but they should be kept in silent mode (please note that vibrate is not silent). In the case of an emergency, students are requested to quietly excuse themselves from the classroom to make and/or receive telephone calls. Please abstain from any disruptive behavior, the definition of which is at the discretion of the instructor and may include but is not limited to playing video games, listening to music, trolling the internet, or any other activity whereby the sight or sound interrupts the attention of those around you.

Multiple offenders may be asked to leave the classroom and will be considered equivalent to an unexcused absence.

Digital Recording Policy

  • Identifying photos or videos of people are not permitted without just cause.

Due to privacy laws/concerns, the recording of people and/or their voices during class meetings in any form is prohibited without unanimous approval except in documented cases from the Academic Support office that require compliance with the ADA.

Computer Policy

  • You are required to have (access to) a laptop computer that runs a modern operating system.

Standards for Written Work

  • Written work should be in English and follow, to the best of your abilities, the rules of the English language (see Strunk & White).
  • Written work should be neat, thorough, legible, and logically organized.
  • All submitted work should include your name, date, and description.
  • Filenames should be in all lowercase and contain only alphanumeric, underscore, hyphen, and/or period characters.
  • Sketches and figures should be drawn using a straight-edge; plots and graphs should be done using a computer, where appropriate.
  • Tables, figures and images from online sources should include a citation (including the author/publisher, date created/accessed, and URL).
  • All in-text citations and references should be formatted to APA standards unless otherwise indicated.
  • Plagiarism will be taken seriously; if you write something that is not your own original idea or in your own words, then it must be cited! See here for information on plagiarism and how to avoid it.
  • Unless otherwise stated, typed electronic files may be in one of the following acceptable open formats:

Statements and Resources

ADA Statement

William & Mary accommodates students with disabilities in accordance with federal laws and university policy. Any student who feels they may need an accommodation based on the impact of a learning, psychiatric, physical, or chronic health diagnosis should contact Student Accessibility Services staff at 757-221-2512 or at to determine if accommodations are warranted and to obtain an official letter of accommodation. For more information, please see https://www.wm.edu/sas.

If you need accommodations, you have a right to have these met, so it is best to notify the instructor as soon as possible.