HTML Import Pro

What It Does

File Import - Upload HTML files individually or batch import via ZIP archives. Drag and drop support for quick imports. Process .html, .htm, and .xhtml files.

URL Import - Enter URLs and fetch content directly from the web. AJAX-based processing with real-time progress feedback. Automatic alias generation from source URLs.

Sitemap Import - Parse XML sitemaps to discover pages automatically. Filter URLs before import. Process hundreds of pages with progress tracking.

Smart Extraction - CSS selector-based content targeting or automatic detection using Readability-style algorithms. Strip unwanted elements like scripts, navigation, and sidebars.

Image Handling - Download images from source pages automatically. Save to your Joomla media folder. Optional WebP conversion for better performance.

Import Sources

HTML Files - Upload individual HTML files or batch import multiple files via ZIP archives. Supports .html, .htm, and .xhtml formats. Drag and drop for quick uploads.

Web URLs - Import content directly from any publicly accessible web page. Enter multiple URLs for batch processing. AJAX-powered with real-time status updates.

XML Sitemaps - Parse standard XML sitemaps to discover all pages on a website. Filter URLs by pattern. Upload local sitemap files or fetch from URL.

Content Extraction

CSS Selectors - Define precise selectors to target content:
- Content selector (e.g., article, .post-content, #main)
- Title selector (e.g., h1, .entry-title)
- Elements to strip (e.g., nav, .sidebar, .comments)

Auto-Detection - Smart algorithm automatically finds main content area by analyzing page structure, text density, and HTML patterns. Works like Readability.

Metadata Extraction - Automatically captures:
- Page title from content or meta tags
- Meta description for SEO
- Open Graph data
- Publication dates when available

Image Features

Automatic Download - Images referenced in imported content are downloaded and saved locally. External URLs replaced with local paths.

WebP Conversion - Optional conversion to WebP format:
- Smaller file sizes (25-35% reduction)
- Better page performance
- Automatic fallback if conversion fails

Organized Storage - Images saved in structured folders by article alias for easy management.

Key Features

? 3 Import Sources - Files, URLs, Sitemaps
? Batch Import - ZIP archives and sitemap parsing
? AJAX Processing - Real-time progress and status
? CSS Selectors - Precise content targeting
? Auto-Detection - Smart content extraction
? Image Download - Automatic with path updates
? WebP Conversion - Optional image optimization
? Import Profiles - Save and reuse configurations
? Alias Generation - Automatic from source URLs
? Metadata Extraction - Titles, descriptions, OG data
? Strip Elements - Remove nav, ads, sidebars
? Import Logging - Track all operations
? CLI Support - Command line automation
? Rate Limiting - Respectful URL fetching
? Joomla 4, 5 & 6 - Full compatibility

Perfect For

  • Website Migration - Move content from old HTML sites to Joomla
  • Content Aggregation - Import articles from multiple sources
  • Blog Migration - Bring posts from other platforms
  • Documentation Import - Convert HTML docs to articles
  • Archive Creation - Save web pages as Joomla content
  • Content Backup - Import external content for preservation
  • Site Redesign - Migrate content during redesign projects
  • Bulk Content Creation - Populate sites with existing content

Import Profiles

Save your import configurations for repeated use:

  • Content extraction selectors
  • Title extraction settings
  • Elements to strip
  • Target category
  • Article state and access
  • Image import preferences
  • WebP conversion setting

Load profiles during import or save current settings as new profiles. Perfect for importing from multiple source websites with different structures.

AJAX-Powered Processing

URL and sitemap imports feature real-time feedback:

  • Progress Bar - Visual completion percentage
  • Current URL - Shows which page is being processed
  • Status Indicators - Pending, fetching, completed, error
  • Continue on Error - Keeps processing if individual URLs fail
  • Configurable Delay - Set interval between requests

Configuration Options

Basic Settings
- Default category for imports
- Default author assignment
- Default access level
- Default publishing state

Extraction Settings
- Content CSS selectors
- Title CSS selectors
- Elements to strip/remove
- Auto-detection toggle

Image Settings
- Enable/disable image import
- Destination folder
- WebP conversion toggle

URL Settings
- User agent string
- Request timeout
- Rate limit delay
- robots.txt respect

Permissions
- Configure - Component options
- Access - Basic component access
- Import - Permission to import
- Profiles - Manage import profiles
- Logs - View import history

Technical Features

Modern Architecture
- Joomla 4, 5 & 6 native compatibility
- PHP 8.1+ with type declarations
- Proper namespace implementation
- MVC architecture
- Service-based design

Robust Processing
- DOMDocument HTML parsing
- XPath content extraction
- cURL-based URL fetching
- GD-based image processing
- Batch processing support

Security
- CSRF token validation
- Permission checks
- Input sanitization
- Secure file handling
- Firewall-friendly design

Requirements

  • Joomla 4.0 or later (4.x, 5.x, 6.x)
  • PHP 8.1 or later
  • PHP Extensions: cURL, DOM, libxml
  • PHP GD with WebP support (for conversion)
  • MySQL 5.7+ / MariaDB 10.3+

Support

  • Support Portal: https://support.joomlax.com
  • Email: support@joomlax.com
  • Website: https://www.joomlax.com
  • Documentation: Comprehensive docs included

Why Choose HTML Import Pro?

Multiple Sources - Import from files, URLs, or sitemaps in one component

Smart Extraction - CSS selectors and auto-detection find the right content

Real-Time Feedback - AJAX processing shows progress for every URL

Image Handling - Automatic download and optional WebP conversion

Reusable Profiles - Save configurations for different source websites

Full Logging - Track every import with detailed history

Joomla Native - Built specifically for Joomla following all standards

Migrate your HTML content to Joomla with intelligent extraction, automatic image handling, and powerful batch processing. Perfect for website migrations, content aggregation, and bulk imports.

Joomla 4, 5 & 6 Compatible | 3 Import Sources | Smart Content Extraction | WebP Conversion

Extension Info :

HTML Import Pro is a powerful Joomla component for importing HTML content into Joomla articles. Import from local files, fetch from URLs, or parse entire sitemaps for bulk content migration. Intelligent extraction algorithms find the right content while filtering out navigation, ads, and other noise.

Extension Data :

  • Latest Version1.0
  • DeveloperInfyways Solutions
  • Last Updated20260208
  • Date Published20260125
  • TypePaid download
  • Compatibility :
  • Joomla 3.xYes
  • Joomla 4.xYes
  • Joomla 5.xYes
  • Joomla 6.xYes

Find Similar Extensions