| Age | Commit message (Collapse) | Author | Lines |
|
navey floor (赤坂): 29件取得、Cheerio + WordPress カスタムCMS
下北沢ERA: 22件取得、HTTP only (TLS証明書無効のため)
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
Rewrote selectors to match actual HTML structure after inspecting each site:
- LIQUID ROOM: article selector, date extracted from URL (_YYYYMMDD suffix)
- WWW/WWW X: article.column selector, month from li.month nav, day rollover detection
- 新宿 LOFT: section.block_schedule_list links, full date from time div.year/month/day
- CLUB QUATTRO: li[data-event-date] with ISO date attribute, jp-label time parsing
Result: 0件 → LIQUID ROOM 25件 / WWW 48件 / LOFT 13件 / QUATTRO 24件
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
STEP, mod
New scrapers: Fever 下北沢, Nine Spices 下北沢, 西荻窪 JAM, mod 柴崎, 中野 MOON STEP
Artist extraction added/fixed:
- warp-kichijoji: parse div.w-flyer (clone + remove nested notes-wrapper)
- shibuya-o: rewrite to scrape each sub-venue; artist from li.p-scheduled-card__artist-item
- moon-step-nakano: parse 出演 section from WordPress API description HTML
- mod-shibasaki: fetch individual event pages in parallel; handle live:/出演:/・ bullet formats
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
- Install Playwright + Chromium; add shared browser singleton (playwright.server.ts)
- Rewrite flat-nishiogikubo scraper: Wix calendar via headless browser,
month navigation via date picker, extracts .WPczEB/.ExCBIq selectors
- Rewrite pitbar-nishiogikubo scraper: freecalend.com via headless browser,
parses cal-{member}-{year}-{month}-{day} cell IDs
- scraper-runner: close shared browser after each run with closeBrowser()
- Remove all scrape trigger buttons from web UI (events index, venues page);
remove /api/scrape and /api/scrape-status from routes.ts
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
Pitbar 西荻窪
meets-otsuka: rinky.info プラットフォーム。div.blog-entry.event-wrap[event-date]
から日付・タイトル・出演者・時間・価格・チケットURLを取得。
warp-kichijoji: WordPress カスタムテーマ。<h3>YYYY<br/><span>MM</span></h3> で
年月を取得、article.schedules-box から各イベントをパース。
flat-nishiogikubo: Wix サイトのため JS レンダリング必須。エラーを返す
プレースホルダー実装(Playwright 等への移行が必要)。
pitbar-nishiogikubo: freecalend.com (mem25771) から取得を試みるが、
ボット遮断のため現状はエラー。URL パターン・代替策をコメントに記載。
SCRAPE_TARGETS.md に状態列(✅/⚠️)を追加。
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
|
Full-stack React Router v7 app that scrapes event listings from major
Tokyo live venues (Liquid Room, WWW/WWW X, Shibuya O-EAST, Shinjuku LOFT,
Club Quattro) and stores them in SQLite for browsing and search.
- Modular scraper architecture: add a new venue by dropping a file in
app/scrapers/ and registering it in index.ts
- Routes: /events (filter by keyword/venue/date), /events/:id, /venues,
GET /api/scrape
- EventCard shows artist, date/time, venue, ticket URL, and fee
- Post-scrape per-venue Markdown files generated to events/ (dev reference)
- /add-livehouse Claude Code skill defined in .claude/commands/
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|