Prune Xpath or Xpath Target for Spider Function #683
Closed
felipehertzer
started this conversation in
Ideas
Replies: 2 comments
-
Hi @felipehertzer, the main functions are They take a string (HTML) as input so you could pass the section you want to the link extraction function, you can also use Trafilatura's XPaths and then the extraction. Keep me updated, if you find a convenient way to do it we could work on a PR. See also this issue: #290. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Added feature on #684 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello @adbar,
I am currently testing the Spider function, but I encountered some difficulties in focusing the spider on a specific HTML container. Additionally, I'm exploring the possibility of excluding headers, footers, and other elements from the search. Do you think these adjustments could be feasible with the Spider function?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions