Package org.codehaus.xsite.extractors
Class SiteMeshPageExtractor
- java.lang.Object
-
- org.codehaus.xsite.extractors.SiteMeshPageExtractor
-
- All Implemented Interfaces:
PageExtractor
public class SiteMeshPageExtractor extends Object implements PageExtractor
PageExtractor which extract page information from an HTML file using the SiteMesh library.- Author:
- Joe Walnes, Jörg Schaible
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classSiteMeshPageExtractor.CannotParsePageException
-
Constructor Summary
Constructors Constructor Description SiteMeshPageExtractor()SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem, CharacterEscaper characterEscaper)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper)SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description PageextractPage(File htmlFile)PageextractPage(String filename, String htmlContent)
-
-
-
Constructor Detail
-
SiteMeshPageExtractor
public SiteMeshPageExtractor()
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem, CharacterEscaper characterEscaper)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)
-
SiteMeshPageExtractor
public SiteMeshPageExtractor(com.opensymphony.module.sitemesh.html.TagRule[] rules, com.opensymphony.module.sitemesh.html.TextFilter[] filter, FileSystem fileSystem, CharacterEscaper characterEscaper, AttributedPageBuilder pageBuilder)
-
-
Method Detail
-
extractPage
public Page extractPage(File htmlFile)
- Specified by:
extractPagein interfacePageExtractor
-
extractPage
public Page extractPage(String filename, String htmlContent)
- Specified by:
extractPagein interfacePageExtractor
-
-