开发者

How do I remove the URL protocol and slash from user input in PHP

Example user input

http://example.com/
http://example.com/topic/
http://example.com/topic/cars/
http://www.example.com/topic/questions/

I want a PHP f开发者_如何学Gounction to make the output like

example.com
example.com/topic/
example.com/topic/cars/
www.example.com/topic/questions/


ereg_replace is now deprecated, so it is better to use:

$url = preg_replace("(^https?://)", "", $url );

This removes either http:// or https://


You should use an array of "disallowed" terms and use strpos and str_replace to dynamically remove them from the passed-in URL:

function remove_http($url) {
   $disallowed = array('http://', 'https://');
   foreach($disallowed as $d) {
      if(strpos($url, $d) === 0) {
         return str_replace($d, '', $url);
      }
   }
   return $url;
}


I'd suggest using the tools PHP gave you, have a look at parse_url.

<?php
$url = 'http://username:password@hostname/path?arg=value#anchor';

print_r(parse_url($url));

echo parse_url($url, PHP_URL_PATH);
?>

The above example will output:

Array
(
    [scheme] => http
    [host] => hostname
    [user] => username
    [pass] => password
    [path] => /path
    [query] => arg=value
    [fragment] => anchor
)
/path

It sounds like you're after at least host + path (add others as needed, e.g. query):

$parsed = parse_url('http://www.domain.com/topic/questions/');

echo $parsed['host'], $parsed['path'];

    > www.domain.com/topic/questions/

Cheers


Create an array:

$remove = array("http://","https://");

and replace with empty string:

str_replace($remove,"",$url);

it would look something like this:

function removeProtocol($url){
    $remove = array("http://","https://");
    return str_replace($remove,"",$url);
}

Str_replace will return a string if your haystack (input) is a string and you replace your needle(s) in the array with a string. It's nice so you can avoid all the extra looping.


Wow. I came here from google expecting to find a one liner to copy and paste!

You don't need a function to do this because one already exists. Just do:

echo explode("//", "https://anyurl.any.tld/any/directory/structure", 2)[1];

In this example, explode() will return an array of:

["https:", "anyurl.any.tld/any/directory/structure"]

And we want the 2nd element. This will handle http, https, ftp, or pretty much any URI, without needing regex.

https://www.php.net/manual/en/function.explode.php

If you want a function:

function removeProtocols($uri) { return explode("//", $uri, 2)[1]; }

EDIT: See user comment from Harry Lewis... this is my favourite way to do this now.


You can remove both https and http in one line using a regular expression with preg_replace:

fn (string $url): string
    => preg_replace('~^https?://~', '', $url);
  • Regular expression: ~^https?://~ Test and Explanation (regex101.com)


You could use the parse url Functionality of PHP. This will work for all Protocols, even ftp:// or https://

Eiter get the Protocol Component and substr it from the Url, or just concatenate the other Parts back together ...

http://php.net/manual/de/function.parse-url.php


<?php
// user input
$url = 'http://www.example.com/category/website/wordpress/wordpress-security/';
$url0 = 'http://www.example.com/';
$url1 = 'http://www.example.com/category/';
$url2 = 'http://www.example.com/category/website/';
$url3 = 'http://www.example.com/category/website/wordpress/';

// print_r(parse_url($url));
// echo parse_url($url, PHP_URL_PATH);

$removeprotocols = array('http://', 'https://');

echo '<br>' . str_replace($removeprotocols,"",$url0);
echo '<br>' . str_replace($removeprotocols,"",$url1);
echo '<br>' . str_replace($removeprotocols,"",$url2);
echo '<br>' . str_replace($removeprotocols,"",$url3);

?>
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜