The fact that the page was HTML is getting lost and the lack of .html extention as a hint causes it to treat the page as text. Adding an additional keep tags pair of
<head> and </head>
should work nicely.
My post looks how I intended (besides the mispelling of extension).
Add a "Keep Tag" to grab some useful html that won't include undesired text.
value for Begin: <head>
Value for End: </head>