page-fetcher-api: centralised Playwright page fetcher with DB persistence #43

Closed
opened 2026-06-08 14:31:25 +00:00 by claude · 0 comments
Member

Create a new internal page-fetcher-api service that centralises all web-page fetching via Playwright (headless Chromium).

New projects

  • page-fetcher-api-models: FetchPageRequest, FetchPageResponse, IPageFetcherApiClient, PageFetcherApiSettings
  • page-fetcher-data: PageFetchDbContext, PageFetchEntity, EF migrations (schema: pageFetcher)
  • page-fetcher-api: PlaywrightBrowserService (singleton browser), PageFetcherService, PageController POST /api/page/fetch

Changes to existing services

  • cv-matcher-api: replace HttpClient in JobTextExtractor with IPageFetcherApiClient
  • cv-search-job: replace inline Playwright in HtmlJobSearcher; fetch individual job pages in CvSearchJobTask + keyword pre-filter
  • common: add PageFetcherApiSettings
  • docker-compose.yml, .env files, build.yml: add new service

DB

All fetches saved to pageFetcher.PageFetches (URL, HTML, text, duration, caller, status code).

Create a new internal page-fetcher-api service that centralises all web-page fetching via Playwright (headless Chromium). ## New projects - page-fetcher-api-models: FetchPageRequest, FetchPageResponse, IPageFetcherApiClient, PageFetcherApiSettings - page-fetcher-data: PageFetchDbContext, PageFetchEntity, EF migrations (schema: pageFetcher) - page-fetcher-api: PlaywrightBrowserService (singleton browser), PageFetcherService, PageController POST /api/page/fetch ## Changes to existing services - cv-matcher-api: replace HttpClient in JobTextExtractor with IPageFetcherApiClient - cv-search-job: replace inline Playwright in HtmlJobSearcher; fetch individual job pages in CvSearchJobTask + keyword pre-filter - common: add PageFetcherApiSettings - docker-compose.yml, .env files, build.yml: add new service ## DB All fetches saved to pageFetcher.PageFetches (URL, HTML, text, duration, caller, status code).
Sign in to join this conversation.
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: AI/myAi#43