Author |
Message |
floppydrivez
Involved
data:image/s3,"s3://crabby-images/19054/19054ce8282204d9ea618292eea9886d76874318" alt="Involved Involved"
data:image/s3,"s3://crabby-images/ef758/ef7587a5fda6c8c40ad52d95869189c364c265b0" alt=""
Joined: Feb 26, 2006
Posts: 340
Location: Jackson, Mississippi
|
Posted:
Wed Jul 25, 2007 8:02 am |
|
Maybe not the correct forum, but anyway I am trying to check the content of a page using file_get_contents($url). Sentinel keeps blocking me. Why is this? |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
Gremmie
Former Moderator in Good Standing
data:image/s3,"s3://crabby-images/e0184/e0184c289d846a553594e6ddcdc67f3354a52fed" alt=""
Joined: Apr 06, 2006
Posts: 2415
Location: Iowa, USA
|
Posted:
Wed Jul 25, 2007 8:58 am |
|
Could you be more specific about what you are doing? Sentinel examines $_GET and $_POST and doesn't have anything to do with file_get_contents() per se. But if you were passing the $url to a script that might do it.... |
_________________ Only registered users can see links on this board! Get registered or login! - An Event Calendar for PHP-Nuke
Only registered users can see links on this board! Get registered or login! - A Google Maps Nuke Module |
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 9:01 am |
|
Simply trying to check if a link exist on another page.
Code:function checklink($url, $mydomain){
$url = strtolower($url);
if (strstr(file_get_contents($url), $mydomain)) {
$link = "linked";
} else {
$link = "not linked";
}
return $link;
}
|
|
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
Gremmie
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 9:58 am |
|
How is checklink() being invoked? Are you calling it after passing $url or $mydomain in $_GET or $_POST?
How do you know it is file_get_contents()?
What does Sentinel say on the ban screen for the reason? What are the $_GET & $_POST values on the ban screen? |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 10:14 am |
|
No, I am not using $_GET or $_POST which leaves file_get_contents
Code:checklink($usersite, $nukeurl);
|
I know its file_get_contents because I stripped out everything from the function echo'd the output. BTW $usersite is just user_website from the database.
This is what I get when I did so nothing special
Quote: | You have attempted to access this site with an invalid User Agent.
If you think this is a mistake you can contact the site webmaster at admin@blah.com.
Be SURE to include the following information in any email!
User Agent: none
Remote Address: 72.249.25.225
Client IP: none
Forwarded For: none |
Just curious if there are some extra steps I should take to allow that site to access the other. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
evaders99
Former Moderator in Good Standing
data:image/s3,"s3://crabby-images/c915b/c915ba1715f1389dcc5b042d6c45c550b39402b4" alt=""
Joined: Apr 30, 2004
Posts: 3221
|
Posted:
Wed Jul 25, 2007 10:21 am |
|
"invalid User Agent" seems to be the key here. You may need to use the cURL functions to do a proper HTTP retreival and spoof the User Agent |
_________________ - Only registered users can see links on this board! Get registered or login! -
Need help? Only registered users can see links on this board! Get registered or login! |
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 10:21 am |
|
SGremmie, I guess my real question is how can bypass that invalid user agent bit. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
Gremmie
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 10:26 am |
|
Oh I get it now....lol....sorry...
As evaders suggests, you might have to form up a more proper HTTP request.
Or if the site you are querying is under your control, you can modify Sentinel to remove that invalid user agent check. There was a recent thread about how to do this. I hope Bob adds a flag for that like the invalid IP check flag in a future version. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 11:22 am |
|
Anyone a curl pro? I see lots of options to use in my function but I honestly have no clue what they mean. I am simply guessing and guessing.
Code:function checklink($url, $mydomain){
$ch = curl_init();
curl_setopt ($ch, CURLOPT_URL, $url);
curl_setopt ($ch, CURLOPT_HEADER, true);
curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt ($ch, CURLOPT_FRESH_CONNECT, true);
$contents = curl_exec ($ch);
curl_close ($ch);
$arrText = array();
$arrText = explode("\n", $contents);
if (strpos ($contents, $mydomain) != FALSE) {
return true;
} else {
return false;
}
}
|
|
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 11:27 am |
|
I can see now I got some learning to do. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 11:50 am |
|
Chalk one up to the manual for some help.
This needs some refining. I still don't understand some of it. Anyway works like a champ.
Code:function checklink($url, $mydomain){
$agent = "CTLinks";
$ch = curl_init();
curl_setopt ($ch, CURLOPT_URL, $url);
curl_setopt ($ch, CURLOPT_HEADER, 0);
curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt ($ch, CURLOPT_FRESH_CONNECT, 1);
curl_setopt($ch, CURLOPT_USERAGENT, $agent);
$contents = curl_exec ($ch);
curl_close ($ch);
$arrText = array();
$arrText = explode("\n", $contents);
if (strpos ($contents, $mydomain) != FALSE) {
return true;
} else {
return false;
}
}
|
Don't forget to set the agent string to something relevant. Like I said, it works but I haven't put it through a tough beta yet. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
montego
Site Admin
data:image/s3,"s3://crabby-images/90769/907690f0b3800b7c3631940ce09741fc8d7ec9ba" alt=""
Joined: Aug 29, 2004
Posts: 9457
Location: Arizona
|
Posted:
Wed Jul 25, 2007 7:08 pm |
|
floppydrivez wrote: | I can see now I got some learning to do. |
Me too.
Thanks Evaders! |
_________________ Only registered users can see links on this board! Get registered or login!
Only registered users can see links on this board! Get registered or login! |
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
floppydrivez
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 7:13 pm |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
evaders99
data:image/s3,"s3://crabby-images/6ea31/6ea3138e9a23822aea960115951a6c1ae34639ea" alt=""
|
Posted:
Wed Jul 25, 2007 8:16 pm |
|
cURL is pretty fun stuff. Sadly it allows the hackers to do the same as you, HTTP requests that look pretty legitimate... fortunately I've not seen massive abuse of it yet. |
|
|
|
data:image/s3,"s3://crabby-images/74676/7467655c43f84619d5d7cf725b1d668453dba0fe" alt="" |
|