Fetchcraft LabsContact
Back to blogs
Strategy

Phone Specs Database Automation: Build catalog, comparison, and device intelligence workflows

Build phone spec workflows for catalogs, comparison filters, app compatibility research, and market analysis using structured device data.

Jun 202611 min readBy FetchCraft Labs
phone specs databasesmartphone specifications datasetphone comparison datadevice catalog automationmobile device dataApify actor

Last reviewed: June 1, 2026.

Phone specification data becomes useful when it stops being a page you read and becomes a table you can join, filter, and test against. A phone specs database can support comparison pages, device filters, market research, app compatibility analysis, product catalog enrichment, and internal reference tools.

The practical challenge is not finding one phone's specs. The challenge is building a repeatable workflow that can collect many device records, keep field names stable, preserve model context, and make the data usable outside the source interface.

This playbook shows how to use Phones Specs DB for that kind of workflow. It is based on the live Phones Specs DB Apify actor, which searches by brand or release year and returns structured device records with fields such as model identity, release dates, physical dimensions, display, CPU, RAM, cameras, battery, connectivity, operating system, market countries, and price when available.

Quick answer

Use a phone specs database when your workflow needs repeatable device-level data, not one-off lookups.

Good fits:

  • build a searchable phone catalog
  • enrich ecommerce or affiliate product pages
  • create phone comparison filters
  • analyze releases by year, brand, chipset, battery, display, or market
  • support app compatibility and QA research with device attributes
  • maintain an internal reference table for mobile devices

Less suitable:

  • live inventory or availability tracking
  • official manufacturer-only compliance data
  • pricing feeds that must update hourly
  • workflows that need only one or two model lookups

How the actor works

Phones Specs DB accepts one of two search modes:

Search modeInputBest use
Brand searchsearchType: "brand" plus brandBuild a manufacturer-level catalog, such as Apple, Samsung, Nokia, Xiaomi, or Motorola
Release-year searchsearchType: "releaseYear" plus releaseYearBuild a time-based dataset, such as phones released in 2023 or 2024

The actor page states that release year must be 1991 or later. It also says the actor automatically fetches all available results for the chosen search criteria, so the workflow should be designed around batches rather than single records.

Example brand input:

{
  "searchType": "brand",
  "brand": "Apple"
}

Example release-year input:

{
  "searchType": "releaseYear",
  "releaseYear": "2023"
}

The output is a dataset of device records. A typical item can include:

  • ID, brand, model, codename, release date, announced date
  • width, height, depth, mass, build details
  • platform and operating system
  • CPU, chipset, GPU, RAM type, RAM capacity, storage context
  • display diagonal, resolution, pixel density, display type, refresh rate
  • rear and front camera details
  • battery type, nominal battery capacity, charging power
  • Bluetooth, Wi-Fi, USB, SIM, sensors, durability ratings
  • supported markets and price when available

That breadth is what makes the actor more useful as a pipeline input than as a simple scraper result.

Workflow pattern: raw data to usable device table

Do not send scraped device records straight into a production catalog. Keep a small pipeline between extraction and publishing.

Recommended flow:

  1. Run the actor by brand or release year.
  2. Store the raw dataset exactly as returned.
  3. Normalize field names into your internal schema.
  4. Create derived fields for filtering and search.
  5. Run validation checks.
  6. Publish to the destination: CMS, database, spreadsheet, search index, BI tool, or internal API.

The raw dataset is your audit trail. The normalized table is what your product, SEO, or analysis system should use.

Reference schema for downstream systems

A good downstream schema separates source fields from business fields. You do not need to keep every source value in your main table, but you should keep enough to support your use case.

Internal fieldSource examplesWhy it matters
device_keysource ID plus normalized modelStable dedupe key
brandbrandBrand pages, filters, joins
model_namemodelSearch, title generation, comparison UI
release_yearreleased or announced dateTrend analysis and year filters
device_categorydevice categorySeparate phones from related device classes
platformplatform, operating systemAndroid/iOS grouping and compatibility analysis
chipsetCPU/chipset stringPerformance grouping and technical filters
ram_mbRAM capacityRange filters and comparison tables
display_inchesdisplay diagonalBuyer-facing filters
resolutionresolutionComparison and display-density logic
battery_mahnominal battery capacityBattery filter and ranking logic
rear_camera_mpcamera fieldsCamera comparison logic
market_countriesmarket countriesRegional availability context

Keep the original source fields in a JSON column or a raw table. This lets you add new derived fields later without rerunning every historical extraction.

Playbook 1: Build a phone comparison database

Comparison sites need consistent attributes. If one model has display_diagonal, another has screen_size, and a third has only a paragraph of text, filters become unreliable.

Workflow:

  1. Run by brand for your priority manufacturers.
  2. Normalize units for display size, weight, battery, RAM, storage, and charging power.
  3. Create boolean or range fields such as has_5g, battery_over_5000_mah, display_120hz, or usb_c.
  4. Build comparison pages from normalized fields, not raw scraped text.
  5. Keep a manual review queue for high-traffic models.

Example derived fields:

{
  "brand": "Xiaomi",
  "model_name": "Mi 13 5G Premium Edition",
  "release_year": 2023,
  "battery_mah": 4500,
  "display_refresh_rate_hz": 120,
  "has_5g": true,
  "usb_type": "USB Type-C"
}

The important part is not the exact field list. The important part is that every comparison page reads from the same normalized table.

Playbook 2: Enrich ecommerce and affiliate content

Google's Merchant Center product data specification emphasizes accurate, well-formatted product information. For phone and electronics publishers, structured specs can help keep product pages consistent across titles, descriptions, comparison tables, and internal filters.

Use phone specification data to enrich:

  • product pages
  • phone collection pages
  • comparison widgets
  • "best phones by feature" pages
  • editorial buying guides
  • internal product matching tables

Do not treat the actor as a replacement for live merchant feeds. It is better used as reference data: model identity, technical specs, release context, and feature attributes. Inventory, offers, shipping, and availability should still come from commerce systems or merchant feeds.

Practical mapping:

Page elementSpec data that helps
Product titlebrand, model, storage/RAM variant when available
Short descriptiondisplay, chipset, camera, battery, release year
Filtersbrand, release year, OS, display size, battery, 5G, camera
Comparison tablenormalized display, CPU, RAM, battery, camera, dimensions
Internal linksbrand pages, year pages, feature pages

External reference:

Playbook 3: Analyze market movement by release year

Release-year search is useful when the question is about market movement instead of one manufacturer.

Questions you can answer:

  • Which brands released the most models in a given year?
  • How quickly did 5G move into mid-range devices?
  • Which display refresh rates became common by year?
  • How did average battery capacity shift over time?
  • Which markets appear most often in exported device records?

Workflow:

  1. Run one export per release year.
  2. Normalize dates into released_year, released_month, and announced_year.
  3. Create feature flags for chipset family, network generation, battery bands, display class, and charging power.
  4. Aggregate by year, brand, and feature.
  5. Store trend snapshots so reports can be reproduced later.

The best reports come from repeatable definitions. Decide exactly how you parse "5G," "AMOLED," "fast charging," and "flagship chipset" before building charts.

Playbook 4: Support app compatibility research

App teams often need device context. Google Play Console's device catalog helps developers review supported devices, understand device-level differences, and restrict app availability by device specs or compatibility signals.

Phones Specs DB is not a substitute for Play Console's official compatibility catalog. It can still help product and QA teams build a broader research layer around device characteristics.

Useful questions:

  • Which popular model families share a chipset or GPU?
  • Which devices have screen sizes or resolutions that need extra QA coverage?
  • Which older Android versions still appear in target device groups?
  • Which countries or markets appear for specific device variants?
  • Which device classes should be included in manual testing pools?

Workflow:

  1. Export devices by brand or year.
  2. Normalize OS, CPU, display, memory, and market fields.
  3. Group devices into QA buckets such as low-memory, high-refresh display, old OS, foldable/tablet-adjacent, or region-specific variants.
  4. Cross-check final compatibility decisions in official app platform tools.

External references:

Quality checks before publishing

Phone specs can be messy because model names, regional variants, and units vary. Add QA before the data reaches pages or dashboards.

Recommended checks:

CheckWhat to catch
Duplicate model checkSame model imported from multiple runs
Unit parsing check4500 mAh, 6.4 inch, 189 g, 67 W parsed correctly
Empty critical fieldsMissing brand, model, release year, OS, display, battery
Variant groupingStorage, RAM, region, and dual-SIM variants not collapsed incorrectly
Date sanityAnnounced and released years in expected range
Feature flag audit5G, NFC, Wi-Fi, USB, and camera flags match source text
Manual review listHigh-traffic devices and unusual records checked by a person

For publishing workflows, keep both raw and normalized values. Example: store nominal_battery_capacity_raw as returned and battery_mah as your parsed number. This makes debugging much easier when a parser rule fails.

Example automation architecture

A practical setup can stay simple:

LayerJob
Apify actor runCollect devices by brand or release year
Raw datasetPreserve source output exactly
Normalization jobParse units, clean names, build IDs, create feature flags
QA reportFlag missing fields, duplicate variants, parsing failures
DestinationCMS, database, Sheets, search index, BI tool, or API
Refresh scheduleMonthly, quarterly, or release-cycle based

Refresh cadence depends on the use case. A static reference database can update monthly. A publication workflow covering new phone launches may need weekly checks during busy release periods.

Common mistakes

Treating every variant as the same phone

Regional variants can differ by storage, RAM, SIM support, connectivity, market, or model number. Decide whether your product needs variant-level records or model-family records.

Mixing raw strings with numeric filters

Do not filter on raw values like "4500 mAh battery" or "161.54 mm 6.4 inch". Parse them into numeric fields first.

Publishing without source freshness notes

Specs databases are reference data. If your page or internal tool makes business decisions from the data, show last-updated dates and keep a refresh schedule.

Using specs data as live pricing data

Specs and price fields are useful context, but live commercial pricing should come from merchant feeds, retailer APIs, or commerce systems.

When this workflow is worth building

Build this automation if:

  • you manage more than a few dozen phone pages
  • you need brand or release-year coverage
  • you publish comparison tables or feature filters
  • you need repeatable device research
  • your team spends time copying specs by hand

Do not build it if:

  • your content team only publishes occasional one-off reviews
  • you need official compatibility decisions only
  • your main problem is live price or stock availability

FAQ

What is the best way to start a phone specs database?

Start with one brand or one release year, then normalize a small set of high-value fields: brand, model, release year, OS, chipset, display size, resolution, battery capacity, camera, and market. Review the output manually before scaling to more brands.

Should I search by brand or release year?

Use brand search when building manufacturer pages, comparison pages, or product catalogs. Use release-year search when analyzing market trends, release patterns, and feature adoption over time. Many teams use both, then dedupe records in a normalized table.

Can this replace Google Play Console's device catalog?

No. Use Play Console for official app compatibility and supported-device decisions. Use a broader phone specs database for research, QA planning, catalog enrichment, and market analysis around device characteristics.

What fields matter most for comparison pages?

The highest-value fields are brand, model, release year, operating system, chipset, RAM, storage, display size, resolution, battery capacity, charging power, camera details, connectivity, and dimensions. Keep raw fields too, because parsing rules improve over time.

Related pages

Next steps

  1. Run a small Phones Specs DB export for one brand or one release year.
  2. Save the raw dataset.
  3. Normalize 10-15 fields into a device table.
  4. Build one comparison, catalog, or QA report from that table.
  5. Add validation checks before expanding coverage.