Input mode
Directory page URL
Paste the exact URL of a faculty/staff directory page. The scraper will fetch it directly — no discovery, no pagination.
Directory page URLs
One URL per line. Each URL is scraped directly as a faculty/staff directory page.
CSV File
Required columns: institution, department, directory_url
Drag & drop a CSV or click to browse ✓ entries loaded — click to replace
Preview (first 5 rows)
University homepage URLs
One URL per line. The scraper will auto-discover departments and directory pages.
Filter by role
Saved custom roles (persisted)
Options
VPN rotation (PIA)
Rotates IP every 15 requests
SOCKS5 proxy
Route requests through configured proxy
Strict URL mode
Only scrape the given URL — no tab or department discovery
Status
Contacts found
URLs processed
Run ID
Live log
Waiting for output…
No contacts yet. Start a run or load one from History.
| Institution | Department | City | State | First Name | Middle | Last Name | Title | Score | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| LI ↗ — |
No contacts yet. Run a scrape or click Refresh.
| Institution | Department | City | State | First Name | Middle | Last Name | Title | Score | ACM | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LI ↗ — | ✓ — | ||||||||||||
|
|
|||||||||||||
No duplicates found
All contacts have unique email addresses and names.
Why do I see duplicates that only appear once in the main table?
The main view automatically hides duplicates by showing only one contact per email address. These are real duplicate rows in the database that still take up space and can cause issues (e.g. double-syncing to ACM). Use the buttons below to clean them up permanently.
The ★ kept row is the one that will survive — auto-selected as the richest record, but you can hover any row and click Keep to override it (useful when the auto-selected record has garbage data like "Main Menu"). Then click "Merge group" to delete the rest.
| Name | Title | Institution | Run | |||
|---|---|---|---|---|---|---|
| ★ | ★ kept |
Page details
The scraper will try to auto-detect the URL from the pasted HTML, but providing it here is more reliable.
When enabled, the scraper visits each person's profile sub-page to collect email addresses. Slower but necessary for sites where emails are only on individual profile pages.
Department is auto-detected from the pasted HTML (title tag, headings). You can paste multiple HTML sections to accumulate contacts in the same session.
Paste HTML
Session contacts
No contacts yet.
Paste HTML and click Extract.
No runs yet. Start a scrape from the New Run tab.
New Schedule
No schedules yet. Create one to run scrapes automatically.
Institution Coverage
All institutions and departments scraped via Batches.
| Institution | Department | Contacts | URLs scraped | Last Scraped | Batches |
|---|---|---|---|---|---|
| Loading… | |||||
| No batch scrapes found. Run batches to build coverage. | |||||
| (pre-batch) | |||||
ACM Subscriber Sync
Push eligible contacts to your ACM mailing list.
Synced — pushed to ACM · Pending — eligible, not yet synced · Incomplete — missing name/email/state · Skipped — filtered out by role/field
Run full sync
Pushes all pending contacts to ACM. Already-synced contacts are skipped.
Sync complete
Errors:
Recently synced
Smart Merge
Review and merge duplicate contacts grouped by email address.
No duplicate email groups found
All contacts have unique email addresses.
Select a group to review
source records ·
Why this needs review:
Source Records (click a cell to use that value)
| Run | First Name | Last Name | Title | Institution | Department | City | State |
|---|---|---|---|---|---|---|---|
Green = best & high-quality · Amber = best but suspect · Red = score 0 · Grey = not selected
Proposed Merged Record (edit to override)
ACM synced — acm_synced_at and acm_uid will be preserved on the merged record.
Role Library
Roles saved here appear as filter options in All Results. Default roles (grey) are built-in and cannot be removed.
Custom roles
No custom roles yet. Add one above.
Default roles
Domain Lookup
Maps email domains to institution, city, state, and cluster. Used to enrich contacts missing location data.
New Entry
| Domain | Institution | City | St | Cluster | Actions | ||||
|---|---|---|---|---|---|---|---|---|---|
| Loading… | |||||||||
| No entries found. | |||||||||
|
|
|||||||||
Promo & Referral Codes
| Code | Label | Discount | Type | UTM Source | Uses | Expires | Status | Booking Link | Actions |
|---|---|---|---|---|---|---|---|---|---|
| / / ∞ | Active Disabled |
Share the booking link (e.g. https://yourdomain.com/book?code=REF-PRINCETON-2026) in ACM emails or referral messages.
The link validates the code and redirects to SM8 with the UTM source appended automatically.