Skip to content

Record Selector spec

Design goals

  • When the user wants to enter a value into a FK cell, they use the record selector to select a record from the linked table.
  • If the linked table does not already contain a matching record, then the record selector makes it easy for the the user to create a new record and select it.

Basic behavior

Here’s a “Publications” record selector, as it looks immediately after opening:

image

Layout

  • It’s a modal.
  • Its height grows as needed, but no more than 10 results ever display, making vertical scrolling unlikely.
  • If vertical scrolling becomes necessary, then only the results scroll (with the headers and search inputs remaining fixed).
  • To account for tables with many columns, the width of the modal should grow as needed until it fills almost the entire viewport width. Even so, horizontal scrolling may be needed in some cases.

Opening

  • When a FK cell is selected, pressing Enter opens the record selector for it. Note: This behavior is consistent with Google Tables, but different from AirTable. For a cell with existing values, AirTable does not open the record selector on Enter, presumably to prevent the user from accidentally overwriting values. I prefer the consistency of Google’s behavior here.
  • Double-clicking a FK cell opens the record selector for it. (This UX is also more in-line with Google than AirTable.) Note: We could consider adding additional entry points such as an icon click target within blank cell (akin to AirTable), but I think it’s worth exploring those details later.
  • The record selector functions identically regardless of whether its origin cell already contains a selected record. This behavior matches Google Tables, but differs from AirTable because AirTable requires the user to manually clear the cell before opening the record selector.

Search inputs

  • After opening, the input within the first non-PK column has focus. (And this behavior may actually be a bit more smarter once we have implemented some other features for which I have not yet released specs documents.)
  • Tab and Shift+Tab moves focus between the search inputs.

The “selected option”

  • The record selector presents records as “options”, similar to a <select> element.
  • One (and only one) option is selected at all times.
  • The selected option is styled the same as when a row is selected in a sheet.
  • The first row is selected by default.
  • Up and Down keys move the selection.
  • If the user modifies the selection and then modifies the query, the selection is reset to the the first row.

Closing

  • Clicking the close button on the modal closes the record selector without producing a record.
  • Pressing the Esc key closes the record selector without producing a record.
  • Single-clicking on a result closes the record selector, producing the record associated with that result.
  • Pressing Enter closes the record selector, producing the record associated with the currently select option.
  • When a selection is made, the value of the cell is updated to link to the selected record. When no selection is made, the value of the cell is not changed.

Search algorithm

The search uses some fuzzy logic when finding and sorting records. Here’s how it works:

Overview

  • The user enters values into the search inputs exactly as they would enter them into cells within the sheet. For example, a Date type will have a date picker. There will likely be some edge cases here to work out during implementation, especially when dealing with invalid input.
  • Records are matched against all query columns using a points system.
  • For non-string-like columns (e.g. Number, Date, etc), a record get 4 points per column matched. A match is performed via strict equality.
  • For string-like columns, a record gets 4 points for exact matches, 3 points for matches at the start of the string, and 2 points for matches within the string.
  • Results are displayed with records listed first that have the highest total points across all query columns.
  • Without any search criteria, ten records are displayed, ordered by descending primary key (Note that the mockups herein do not accurately reflect this requirement.)

Example

  1. Begin with our shared sample data (transformed to the final schema).
  2. Add some more data:

    insert into "Authors" ("First Name", "Last Name") values
    ('Brianna', 'Murphy'),
    ('Annabelle', 'Smith'),
    ('Joanna', 'Stevenson'),
    ('Adrian', 'Richardson'),
    ('Anna', 'Richardson'),
    ('Anna', 'Wilson-Rich'),
    ('Joanna', 'Rich'),
    ('Hans', 'Ulrich');
    
  3. We are looking for an author named “Anna Rich”. Let’s search the “Authors” table by entering “Anna” into “First Name” and “Rich” into “Last Name”.

  4. This produces the following results (with the points column displayed here for reference).

    Id First Name Last Name Website points
    Anna Rich
    276 Anna Richardson 7
    277 Anna Wilson-Rich 6
    278 Joanna Rich 6
    137 Barbara Rich 4
    80 Anna Humphrey http://annahumphrey.com 4
    6 Anne Richards http://richards.biz 3
    39 Michael Richards 3
    71 James Richard https://richard.com 3
    272 Annabelle Smith 3
    274 Adrian Richardson 3
  5. We don’t get any exact matches, but the records that match most closely are listed first.

  6. The query is run as follows:

    WITH anon_cte AS (
        SELECT *,
            CASE WHEN "First Name" ILIKE 'Anna' THEN 4
                WHEN "First Name" ILIKE 'Anna%' THEN 3
                WHEN "First Name" ILIKE '%Anna%' THEN 2
                ELSE 0
            END
            +
            CASE WHEN "Last Name" ILIKE 'Rich' THEN 4
                WHEN "Last Name" ILIKE 'Rich%' THEN 3
                WHEN "Last Name" ILIKE '%Rich%' THEN 2
                ELSE 0
            END
            AS points
        FROM "Authors"
    )
    SELECT * FROM anon_cte
    WHERE points > 0
    ORDER BY points DESC
    LIMIT 10;
    

    The case statements should probably be wrapped into some db-layer function like search_score(text, text) -> int, and overloading that would enable scoring other types.

Creating a new record

  1. When at least one column contains a query, a “ghost row” will appear above the result set, allowing the user to select a record that will be created on-the-fly using all of the data from their search. The ghost row is filled in as the user types.

    image

  2. The user can highlight the ghost row by pressing Up – and in this state, validation errors will be displayed for cells as needed using the same UX as when adding a new row to a table.

    image

  1. To create a new Publications record, the user needs to supply an Authors record. They use a “nested” record selector to locate or create one Authors record as follows:

    image

    Note: There are some layout details here that will need to be worked out during implementation, such as horizontal scrolling.

  2. The nested record selector opens when its target cell is selected (there’s not need to press Enter), and it closes when a different higher-level cell is selected. There’s no way for it to be closed while its target cell is selected.

    Note: UX gets a bit weird with tables that only contain FK columns. In the worst case scenario, a table would have only two columns, both of which are FKs. Selecting a record would be impossible because there would be no way to close the nested selectors. In the slightly-better-but-still-bad case, a table contains a PK plus two FKs. Here, you’d need to select the PK cell to hid the nested selectors. These cases are rare enough that I think it’s fine to discount them for now. With some more thought and experimentation, we can detect cases like that adjust the UI behavior a bit to better suit them.

  3. After selecting an Authors record, the top-level Publications record selector will become active again, with a value for “Author Id” filled in, and with a string representation of that Authors record displayed in its place.

Unsolved problems

Updating a record before selecting it

Use case…

  1. I’m entering a new “Items” record.
  2. For my item, I’m selecting a “Publications” record.
  3. I’ve found the publication using the ISBN, so I’m confident that it’s the correct record. I’d like to select it!
  4. But I also notice that the record is missing a “Year” value.
  5. I can see the year on the book right in front of me so I’d like to update the data, making it cleaner. But I don’t want to lose my place in the process of adding a new “Items” record. I’d like to update the publication year inline before choosing it.

With a little more UX work, I think it would be possible to incorporate an “update” functionality into the record selector.

Locating a record based on aggregate data

Use case…

  1. I’ve acquired a new copy of Moneyball by Michael Lewis, and I’d like to enter it as an item.

  2. Within the item, I begin to locate or create a “Publications” record. My search looks as follows:

    Id ISBN Title Year Author Id Publisher Id
    Moneyball
    91821 0393324818 Moneyball: The Art of Winning an Unfair Game 2004 Michael Lewis W. W. Norton Company
  3. The first result appears to be a match at first, but then I notice that the book I’m holding is the 2011 edition, with ISBN 0393338398. Upon entering those details I see there is no matching record in the database, so I set out to add one by filling in all the fields I can.

  4. I need to associate my new Publications record, with an Authors record. Here’s what my Authors search looks like:

    Id First Name Last Name Website
    Michael Lewis
    8276 Michael Lewis NULL
    13901 Michael Lewis NULL
  5. Hmmm. There are two Michael Lewis records! Which do I chose? Are they duplicates? I real life, there are actually as many as twenty published authors named Michael Lewis! While picking an Authors record, I want to see a list of books associated with each author.

    This is tricky though. The Michael Lewis I’m after has published a lot of books! How will that list of books be sorted? What text will display for each book? (Because the full title of Moneyball is actually quite long.) Can I filter the list? Can I select an author named Michael Lewis who has published at least one book whose title begins with Moneyball??? Designing a UX which helps answer that sort of question (without lots of clicks and keystrokes) is an interesting challenge!

We could also construct a more common use case if we venture outside our example schema. Consider a CRM-like schema with person records which each have many email_address records. When entering an activity record, I want to select a person that has at least one email_address which contains a field that matches a query. And in this case I also want to add a new person with that email address if I don’t find a matching person. Adding is especially tough because we can’t create the email_address record until we create the peron record, so the “drill-down-and-back-up-again” UX outlined in the specs above won’t work for the email_address.

These scenarios are more tricky to handle and will require more UX thought. But for some schemata handling them well will be pretty important.