dapengde
没错,都是空格惹的祸src = “
和sep= ” “
这两个地方都是因为多了空格出现了各种错误,下面是sep=” “
存在的错误:
curl: (23) Failed writing body (0 != 6655)
Error in download.file(url = bb[i], destfile = paste("c:/r4r/", stname[i], :
'curl' call had nonzero exit status
这两个地方的空格去掉之后,只要不上房揭瓦,真是想怎么玩就怎么玩!看来空格不是我想加就能任性加呀[笑哭]
谢谢您的悉心解惑啦!
[1] "88 of 597 downloaded."
curl: (3) URL using bad/illegal format or missing URL
Error in download.file(url = bb[i], destfile = paste("c:/r4r/pic/", stname[i], :
'curl' call had nonzero exit status
突发情况,下载到中途还是出现了问题,curl
开始第三次循环下载时,突然连接不到链接了!本以为是高枕无忧了,中间只是自己新建了个文件夹pic
,不知是不是又哪多了空格不是?源代码如下:
urlink <- 'http://www.biomet.co.at/pictures/'
aa <- readLines(urlink, encoding = 'UTF-8')
linkformat <- 'src="http://www.biomet.co.at/wp/wp-content/gallery'
bb <- aa[grep(linkformat, aa)]
for(i in 1:length(bb))
bb[i] <- substring(
bb[i],
regexpr("http", bb[i])[1],
regexpr(".jpg\"", bb[i])[1]+3)
bb <- unique(bb)
length(bb)
dir.create('c:/r4r/pic/')
writeLines(bb, 'c:/r4r/pic/links.txt')
stname <- substring(bb, 47, 50)
stname <- stname[-which(stname == '')]
for(i in unique(stname))
dir.create(paste('c:/r4r/pic/', i, sep = ''))
for(i in 1:length(bb)) {
download.file(
url = bb[i],
destfile = paste(
'c:/r4r/pic/', stname[i],'/', stname[i], i, '.jpg',
sep = ""),
method = 'curl', quiet = TRUE)
print(paste(i, 'of', length(bb), 'downloaded.'))
}
经还原为原来文件夹路径的代码运行,依旧出现下载第88张照片之后就报错!可见也不是这个原因,代码也再次检查了应该是没有任性的添加空格了。