Repetitively Retrieve Data from Site via PHP

2022-12-11 08:24 问答作者：

When accessing http://www.example.net, a CSV file is downloaded with the most current data regarding that site. I want to have my site, http:开发者_JS百科//www.example.com, access http://www.example.net on an hour by hour basis in order to get updated information.

I want to then use the updated information stored in the CSV file to compare changes from data in previous CSV files. I obviously have no idea what the best plan of attack would be so any help would be appreciated. I am just looking for a general outline of how I should proceed, but the more information the better.

By the way, I'm using a LAMP bundle so PHP and mySQL solutions are preferred.

I think the most easy way for you to handle this would be to have a cron job running every hour (or scheduled task if are on windows), downloading the CSV with curl or file_get_contents(manual). When you have downloaded the CSV you can import new data in your MySQL database.

The CSV should have some kind of timestamp on every row so you can easily separate new and old data.

Also handling XML would be better then plain CSV.

A better way to setup that would be you to create a webservice on http://www.example.com and update in real time from your http://www.example.net. But it requires you to have access to both websites.

Depending on the OS you're using, you're looking at a scheduled task (Windows) or a cron job (*nix) to kick up a service/app that would pull the new CSV and compare it to an older copy.

You'll definitely want to go the route of a cron job. I'm not exactly sure what you want to do with the differences, however, if you just want an email, here is one potential (and simplified) option:

wget http://uri.com/file.txt && diff file.txt file_previous.txt | mail -s "Differences" your@email.com && mv file.txt file_previous.txt

Try this command by itself from your command line (I'm guessing you are using a *nix box) to see if you can get it working. From there, I would save this to a shell file in the directory where you want to save your CSV files.

cd /path/to/directory
vi process_csv.sh

And add the following:

#!/bin/bash

cd /path/to/directory
wget http://uri.com/file.txt
diff file.txt file_previous.txt | mail -s "Differences" your@email.com
mv file.txt file_previous.txt

Save and close the file. Make the new shell script executable:

chmod +x process_csv.sh

From there, start investigating the cronjob route. It could be as easy as checking to see if you can edit your crontab file:

crontab -e

With luck, you'll be able to enter your cronjob and save/close the file. It will look something like the following:

01 * * * * /path/to/directory/process_csv.sh

I hope you find this helpful.

继续阅读：csv download file php

Repetitively Retrieve Data from Site via PHP

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？