Step 1: Automatic search
For each butterfly species in the checklist we did a preliminary automatic search in BHL for pages where the species were mentioned.
Step 2: Key word matching
Then, we used our keyword list to select the pages with information about biotic associations, resulting in XXXX pages.
Step 3: Automatic download
Next, we implemented a routine to automatic download in PDF format the pages selected, and built a file with the following information:
Column name |
Description |
Example of content |
arch |
Archive name |
XXX.pdf |
code |
Bibliography code in our data base |
Kaye1913 |
nps |
Page number |
230 |
kws |
Keywords matched |
host :: host plant :: feed on :: larvae |
val |
Scientific name from the checklist |
Philotiella Leona |