文件名称:text_extractor_old
介绍说明--下载内容均来自于网络,请自行研究使用
基于BBS类型网站的爬虫,可对一般的BBS类型网站通用,爬取的数据保存至txt格式-Based on the BBS type website crawler
(系统自动生成,下载前可以参看下载内容)
下载文件列表
text_extractor_old
..................\.DS_Store
__MACOSX
........\text_extractor_old
........\..................\._.DS_Store
text_extractor_old\.git
..................\....\.DS_Store
__MACOSX\text_extractor_old\.git
........\..................\....\._.DS_Store
text_extractor_old\.git\config
..................\....\description
..................\....\FETCH_HEAD
..................\....\HEAD
..................\....\hooks
..................\....\.....\applypatch-msg.sample
..................\....\.....\commit-msg.sample
..................\....\.....\post-update.sample
..................\....\.....\pre-applypatch.sample
..................\....\.....\pre-commit.sample
..................\....\.....\pre-push.sample
..................\....\.....\pre-rebase.sample
..................\....\.....\pre-receive.sample
..................\....\.....\prepare-commit-msg.sample
..................\....\.....\update.sample
..................\....\index
..................\....\info
..................\....\....\exclude
..................\....\logs
..................\....\....\HEAD
..................\....\....\refs
..................\....\....\....\heads
..................\....\....\....\.....\master
..................\....\objects
..................\....\.......\56
..................\....\.......\..\9b205ceef5a20d932afa2a3c415b8a00feee0a
..................\....\.......\f3
..................\....\.......\..\f51e8836a8eecd6d45bc33d4867f68a5f39998
..................\....\.......\info
..................\....\.......\pack
..................\....\.......\....\pack-093660246cd290f56a9900f79b5e1fc56237636e.idx
..................\....\.......\....\pack-093660246cd290f56a9900f79b5e1fc56237636e.pack
..................\....\refs
..................\....\....\heads
..................\....\....\.....\master
..................\....\....\tags
..................\.gitattributes
..................\.gitignore
..................\.idea
..................\.....\encodings.xml
..................\.....\misc.xml
..................\.....\modules.xml
..................\.....\text_extractor.iml
..................\.....\vcs.xml
..................\.....\workspace.xml
..................\.vscode
..................\.......\.browse.VC.db
..................\.......\launch.json
..................\__pycache__
..................\...........\cx_extractor_Python.cpython-33.pyc
..................\cx_extractor_Python.py
..................\cx_extractor_Python.pyc
..................\img
..................\...\1.png
..................\...\2.png
..................\...\raw.png
..................\...\text.png
..................\input
..................\.....\bbs_urls.txt
..................\output
..................\......\output-0.txt
..................\......\output-1.txt
..................\......\output-10.txt
..................\......\output-100.txt
..................\......\output-101.txt
..................\......\output-103.txt
..................\......\output-104.txt
..................\......\output-105.txt
..................\......\output-106.txt
..................\......\output-107.txt
..................\......\output-108.txt
..................\......\output-109.txt
..................\......\output-11.txt
..................\......\output-110.txt
..................\......\output-111.txt
..................\......\output-112.txt
..................\......\output-113.txt
..................\......\output-114.txt
..................\......\output-115.txt
..................\......\output-116.txt
..................\......\output-117.txt
..................\......\output-118.txt
..................\......\output-119.txt
..................\......\output-12.txt
..................\......\output-120.txt
..................\......\output-121.txt
..................\......\output-122.txt
..................\......\output-123.txt
..................\......\output-126.txt
..................\......\output-127.txt
..................\......\output-128.txt