697 - Data Sourcing for Sharing Excess by marcbachan · Pull Request #720 · phlask/phlask-map

marcbachan · 2026-03-10T19:01:16Z

Pull Request

Change Summary

FYI: Claude helped quite a bit here in building out a basic CLI component and adding Supabase helper functions. Extra scrutiny on those is welcome.

Addresses #697. Introduces a standalone Python script that is designed to pull down events from a public Google Calendar, such as Sharing Excess, and normalize the retrieved events to be able to store them in the resources table in Supabase.

As the resources table does not have start/end date fields, these are pulled from the site and inserted into the description with some clear delimiters, like:

[[ start: 2026-03-10T15:00:00-04:00 | end: 2026-03-10T17:00:00-04:00 ]]

This allows us to do some post-processing/filtering to determine whether the event is "live" or not.

We can do this scrape periodically by using the LOOK_FORWARD_DAYS property to get all events for a specific window into the future, or just do this monthly in one of the PHLASK sessions or something. Not sure how we want to handle.

Change Reason

Billy summed this up quite nicely on #697. Essentially, we would like to be able to actively maintain "live" food sites posted by Sharing Excess and help them and us get the word out a little easier.

Verification [Optional]

Here is an example of a CSV debug output that we can get by using the basic CLI component that Claude helped write:

 python calendar_to_supabase.py --csv

events.csv

These records can then be written to the DB either directly with CSV import in Supabase, or enter the credentials in the .env file here and run the script with the helper to write them to the resources table.

Related Issue: #697

ical and normalize records for the resource table

marcbachan · 2026-03-24T22:02:51Z

@icycoldveins Got Claude's help with the edge function (supabase/functions/sync-sharing-excess/index.ts) for this in a new directory for Supabase sync operations. Let me know if this lines up with what you had tried out, and let me know if you have any suggestions.

@gcardonag @vontell curious to get your thoughts on this regarding automating and maintaining the sync script by using edge functions. Otherwise we can just use the Python script with Lambda in AWS like the other one.

Match existing records by name + source URL, update those (preserving date_created), insert only new ones. Stale entries that are no longer in the scraped data get cleaned up. Aligns with the approach in PR #720.

…r column

…cript and insert

… of https://github.com/phlask/phlask-map into 697-data-sourcing-for-sharing-excess-food-distribution

…rors with the JSONB object dates, debugging logs for the records to delete to check if the API key is read-only

RRodriguez26 · 2026-05-26T23:40:27Z

+        supabase = get_supabase_client()
+        delete_by_creator(supabase)
+        insert_resources(supabase, resources)


Having a discussion with Ron and Añil, what is the use of the delete function? I also heard that we want to delete anything overall, is there a reason why might we use the delete functionality in the database rather than just update?

@RRodriguez26 Updating is definitely better, and I can tweak this a bit further to do it properly. I settled on delete so that we could update our database with current data, which was becoming a problem. Now that it's updated, I can revisit this and implement it more intelligently.

One of the key issues with doing the update route was that recurring events, despite having many distinct occurrences, all collide under one gp_id, so the script needs to handle this on repeated syncs and update the timestamps for whatever occurrence of that event is up next instead of processing every occurrence as a unique resource.

But overall, yes, you all are right. Deletion creates an issue with churned resource IDs, especially if there are crowdsourced edits using that ID as a foreign key. In the long term that's not sustainable, so I'll work on resolving that update issue for the recurring events.

RRodriguez26 · 2026-05-26T23:44:35Z

I also heard that these data scripts should be in its own repo, we see that there is a repo dedicated to it but we are not sure if this is the right one.

marcbachan · 2026-05-27T13:02:50Z

I also heard that these data scripts should be in its own repo, we see that there is a repo dedicated to it but we are not sure if this is the right one.

Yep, I'm going to put up a joint PR for this script and the other one on that repo. I don't think anyone can recall if there was another reason for it, so it's a good fit.

marcbachan added 3 commits February 24, 2026 19:48

add experimental fetch from sharing excess gcal data

3247976

update the script to get recurring events with

c352872

ical and normalize records for the resource table

no need for new table

2d9fa8a

marcbachan requested a review from vontell March 10, 2026 19:01

marcbachan self-assigned this Mar 10, 2026

marcbachan added Data Circle Tickets related to the Data Circle Civic Circle labels Mar 10, 2026

marcbachan linked an issue Mar 10, 2026 that may be closed by this pull request

Data sourcing for Sharing Excess food distribution #697

Open

marcbachan requested a review from RaulBSanchez March 10, 2026 19:08

marcbachan added 2 commits March 24, 2026 17:53

cleaner address parse to get missing entries

3332e64

claude-assisted attempt at an edge function for this

e65ca77

marcbachan and others added 7 commits April 7, 2026 20:28

update geocoding parse and processing

b39759e

fix: change filter implementation to look for creator value in creato…

83ec7f5

…r column

get the db creds from env, run a delete for prior entries from this s…

553ff0a

…cript and insert

require dotenv

e65a057

less verbose readme, stick to the point

894fbf0

Merge branch '697-data-sourcing-for-sharing-excess-food-distribution'…

5cf8ed9

… of https://github.com/phlask/phlask-map into 697-data-sourcing-for-sharing-excess-food-distribution

better address parsing, add a no hours option to avoid the parsing er…

0d72679

…rors with the JSONB object dates, debugging logs for the records to delete to check if the API key is read-only

RRodriguez26 reviewed May 26, 2026

View reviewed changes

use upserts instead of delete and insert

27c5e5f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

697 - Data Sourcing for Sharing Excess#720

697 - Data Sourcing for Sharing Excess#720
marcbachan wants to merge 13 commits into
developfrom
697-data-sourcing-for-sharing-excess-food-distribution

marcbachan commented Mar 10, 2026 •

edited

Loading

Uh oh!

marcbachan commented Mar 24, 2026

Uh oh!

RRodriguez26 May 26, 2026 •

edited

Loading

Uh oh!

marcbachan May 27, 2026 •

edited

Loading

Uh oh!

RRodriguez26 commented May 26, 2026

Uh oh!

marcbachan commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

marcbachan commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request

Change Summary

Change Reason

Verification [Optional]

Uh oh!

marcbachan commented Mar 24, 2026

Uh oh!

RRodriguez26 May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marcbachan May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RRodriguez26 commented May 26, 2026

Uh oh!

marcbachan commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marcbachan commented Mar 10, 2026 •

edited

Loading

RRodriguez26 May 26, 2026 •

edited

Loading

marcbachan May 27, 2026 •

edited

Loading