Question: I use readHTMLTable() in package(XML) to grab a form in a website("http://www.gaokao.com/e/20201106/5fa4f625338d0.shtml"). But the form is special and looks like the writer merges some cells. So when I grab it, the result is wierd as you can see in the pitcure. So What shoud I do? I heard that there are a lot of masters in the community. So I come here. Thanks a lot!!!!
Here is my code:
temp<-getURL(url,httpheader = myHttpheader,.encoding = "GB2312")
doc<-htmlParse(temp1,asText = TRUE,encoding = "UTF-8")
table1<-readHTMLTable(doc,header = TRUE,which = 1)