Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
Types ¶
type HreflangEntry ¶
HreflangEntry represents a hreflang link.
type Link ¶
type Link struct {
TargetURL string
AnchorText string
Rel string
IsInternal bool
Tag string // "a", "link", "area", etc.
}
Link represents an extracted link from a page.
type PageData ¶
type PageData struct {
Title string
Canonical string
MetaRobots string
MetaDescription string
MetaKeywords string
H1 []string
H2 []string
H3 []string
H4 []string
H5 []string
H6 []string
Links []Link
Images []Image
Hreflang []HreflangEntry
Lang string
OGTitle string
OGDescription string
OGImage string
SchemaTypes []string
WordCount int
ContentHash uint64 // SimHash fingerprint of visible body text
Resources []PageResource
}
PageData holds all extracted SEO signals from a page.
type PageResource ¶
type PageResource struct {
URL string
ResourceType string // "css", "js", "font", "icon"
IsInternal bool
}
PageResource represents a CSS, JS, font, or icon resource referenced by a page.
func ExtractResources ¶
func ExtractResources(doc *goquery.Document, baseURL *url.URL) []PageResource
ExtractResources extracts external resource references (CSS, JS, fonts, icons) from the document.
Click to show internal directories.
Click to hide internal directories.