您现在的位置是：首页 > 后端

当前栏目

scrapy/nginx 服务带有http 验证，怎样使用curl 请求详解程序员

scrapy 服务 Nginx HTTP 程序员使用详解验证

2023-06-13 09:19:55 时间

爬虫服务免不了需要定时启动,就需要crontab+curl 来触发,但是curl 怎样发送带验证的请求.

有些网域需要 HTTP 认证，这时 curl 需要用到 --user 或者 -u 参数。 

$ curl --user name:password example.com

如果不使用验证直接请求网站会有如下提示:

 html 

 head title 401 Authorization Required /title /head 

 body bgcolor="white" 

 center h1 401 Authorization Required /h1 /center 

 hr center nginx/1.14.0 (Ubuntu) /center 

 /body 

 /html

使用验证了以后就没有问题:

curl --user user:password abc.com:6800

 html 

 head title Scrapyd /title /head 

 body 

 h1 Scrapyd /h1 

 p Available projects: b scrapy_rere /b /p 

 li a href="/jobs" Jobs /a /li 

 li a href="/items/" Items /a /li 

 li a href="/logs/" Logs /a /li 

 li a href="http://scrapyd.readthedocs.org/en/latest/" Documentation /a /li 

 /ul 

 h2 How to schedule a spider? /h2 

 p To schedule a spider you need to use the API (this web UI is only for 

monitoring) /p 

 p Example using a href="http://curl.haxx.se/" curl /a : /p 

 p code curl http://localhost:6800/schedule.json -d project=default -d spider=somespider /code /p 

 p For more information about the API, see the a href="http://scrapyd.readthedocs.org/en/latest/" Scrapyd documentation /a /p 

 /body 

 /html

补充说明:

我的scrapy 服务是使用docker搭建的,docker中又使用nginx 代理验证,

原创文章，作者：ItWorker，如若转载，请注明出处：https://blog.ytso.com/1578.html

服务器部署程序员系统优化网站设置运维

猜你喜欢

Linux下批量压缩文件的简便方法（linux批量压缩）
Fuzzable：一款基于静态分析实现的可模糊测试的自动化目标识别工具
图扑智慧城市 | 搭建政务民生可视化管理系统
SQL Server拉取大量数据：不再是难事！（sqlserver拉数据）
html中ul和li的使用_ul列表的html结构
谷歌发表论文破解围棋难题，Facebook却说「是我们先做到的」
怎样解决MySQL下载速度缓慢的问题（mysql下载还在下载）
Linux进程是如何创建出来的？
Linux 系统故障排查，怕了怕了！｜极客时间
删除或修改本地Git账号密码详解编程语言
Linux Shell 文本处理工具集锦-Grep+xargs
Oracle：从字符到数字的转换（oracle字符转数字）
技术硬实力，聊聊写Spring Cloud Alibaba实战派这本书的初衷
BBR vs BBRplus vs BBR2 劣质网络速度对比
如何在Linux中进入DOS系统（linux怎么进入dos）
【MySQL事务隔离级别及其应用】（mysql的事物隔离级别）
官方将设定网约车平台抽成比例上限：要求降低比例并向社会公布
PostgreSQL 数据库性能提升的几个方面
winform中的提示框+MSN提示封装，原生的也不错

zl程序教程

当前栏目

scrapy/nginx 服务带有http 验证，怎样使用curl 请求详解程序员

相关文章