php urlencode()與urldecode()函數(shù)字符編碼原理詳解
Jun 13, 2016 pm 12:03 PM
其原理就是把中文字符轉(zhuǎn)換為十六進(jìn)制并按某種規(guī)則進(jìn)行字符串組合,實(shí)現(xiàn)字符的編碼與解編碼,保證URL數(shù)據(jù)傳遞過程中字符的完整性和兼容性,主要討論中文字符的編碼情況。
一,F(xiàn)ireFox瀏覽器編碼中文字符
在Firefox瀏覽器下如果輸入中文字符,將會自動實(shí)現(xiàn)URL編碼,如下
按下Enter鍵前
按下Enter鍵后
二,urlencode()函數(shù)原理
urlencode()函數(shù)用于編碼URL字符串,這里主要討論中文字符的編碼情況,
實(shí)例如下
復(fù)制代碼 代碼如下:
echo urlencode('不要迷戀哥');//輸出:%B2%BB%D2%AA%C3%D4%C1%B5%B8%E7
urlencode()函數(shù)原理就是首先把中文字符轉(zhuǎn)換為十六進(jìn)制,然后在每個字符前面加一個標(biāo)識符%,了解了這個原理,可以實(shí)現(xiàn)自定義的URL編碼函數(shù),代碼如下
復(fù)制代碼 代碼如下:
$string = "不要迷戀哥";
$length = strlen($string);
echo $string;
$result = array();
//十進(jìn)制
for($i=0;$iif(ord($string[$i])>127){
$result[] = ord($string[$i]).' '.ord($string[++$i]);
}
}
var_dump($result);
//十六進(jìn)制
$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = "%".dechex($dec[0])." "."%".dechex($dec[1]);
}
var_dump($strings);
上面代碼在[PHP實(shí)現(xiàn)中文字符進(jìn)制轉(zhuǎn)換原理分析]一文中中文字符轉(zhuǎn)十六進(jìn)制原理分析部分有詳細(xì)討論,通過獲取漢字的各個字符再轉(zhuǎn)換為十六進(jìn)制,同時在每個字符前面加上一個特殊的標(biāo)識符%,就實(shí)現(xiàn)了urlencode()函數(shù)的功能,輸出結(jié)果如下
然后對輸出的結(jié)果與直接使用urlencode()編碼的字符進(jìn)行比較,如上:%B2%BB%D2%AA%C3%D4%C1%B5%B8%E7
通過上面實(shí)例可知,使用urlencode()函數(shù)編碼中文字符實(shí)質(zhì)上就是把字符轉(zhuǎn)換為十六進(jìn)制再在第個字符左邊加上一個特殊的標(biāo)識符%
三,urldecode()函數(shù)原理
使用urldecode()函數(shù)解碼已編碼的 URL 字符串,實(shí)例如下
echo urldecode('%B2%BB%D2%AA%C3%D4%C1%B5%B8%E7');//輸出:不要迷戀哥
urldecode()函數(shù)與urlencode()函數(shù)原理相反,用于解碼已編碼的 URL 字符串,其原理就是把十六進(jìn)制字符串轉(zhuǎn)換為中文字符,結(jié)合上面實(shí)例,同樣可實(shí)現(xiàn)自定義函數(shù)解碼字符串
復(fù)制代碼 代碼如下:
$string = '%B2%BB%D2%AA%C3%D4%C1%B5%B8%E7';
$length = strlen($string);
$hexs = array();
for($i=0;$iif($string[$i] == '%'){
$hexs[] = $string[++$i].$string[++$i];
}
}
$num = count($hexs);
for($i=0;$iecho chr(hexdec($hexs[$i])).chr(hexdec($hexs[++$i]));
}
上面實(shí)例代碼首先按字符串的規(guī)則取出各個字符的十六進(jìn)制,然后使用hexdec()函數(shù)把十六進(jìn)制轉(zhuǎn)換為十進(jìn)制,然后再使用chr()函數(shù)把十進(jìn)制轉(zhuǎn)換為字符,實(shí)現(xiàn)十六進(jìn)制轉(zhuǎn)換為字符。輸出結(jié)果如下
四,urldecode()與urlencode()函數(shù)說明
urlencode
(PHP 3, PHP 4, PHP 5)
urlencode -- 編碼 URL 字符串
說明
string urlencode ( string str )
返回字符串,此字符串中除了 -_. 之外的所有非字母數(shù)字字符都將被替換成百分號(%)后跟兩位十六進(jìn)制數(shù),空格則編碼為加號(+)。此編碼與 WWW 表單 POST 數(shù)據(jù)的編碼方式是一樣的,同時與 application/x-www-form-urlencoded 的媒體類型編碼方式一樣。由于歷史原因,此編碼在將空格編碼為加號(+)方面與 RFC1738 編碼(參見 rawurlencode())不同。此函數(shù)便于將字符串編碼并將其用于 URL 的請求部分,同時它還便于將變量傳遞給下一頁
urldecode
(PHP 3, PHP 4, PHP 5)
urldecode -- 解碼已編碼的 URL 字符串
說明
string urldecode ( string str )
解碼給出的已編碼字符串中的任何 %##。返回解碼后的字符串。
五,參考資源
urlencode()說明
urldecode()說明

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In PHP, you can use square brackets or curly braces to obtain string specific index characters, but square brackets are recommended; the index starts from 0, and the access outside the range returns a null value and cannot be assigned a value; mb_substr is required to handle multi-byte characters. For example: $str="hello";echo$str[0]; output h; and Chinese characters such as mb_substr($str,1,1) need to obtain the correct result; in actual applications, the length of the string should be checked before looping, dynamic strings need to be verified for validity, and multilingual projects recommend using multi-byte security functions uniformly.

AgeneratorinPHPisamemory-efficientwaytoiterateoverlargedatasetsbyyieldingvaluesoneatatimeinsteadofreturningthemallatonce.1.Generatorsusetheyieldkeywordtoproducevaluesondemand,reducingmemoryusage.2.Theyareusefulforhandlingbigloops,readinglargefiles,or

To prevent session hijacking in PHP, the following measures need to be taken: 1. Use HTTPS to encrypt the transmission and set session.cookie_secure=1 in php.ini; 2. Set the security cookie attributes, including httponly, secure and samesite; 3. Call session_regenerate_id(true) when the user logs in or permissions change to change to change the SessionID; 4. Limit the Session life cycle, reasonably configure gc_maxlifetime and record the user's activity time; 5. Prohibit exposing the SessionID to the URL, and set session.use_only

The urlencode() function is used to encode strings into URL-safe formats, where non-alphanumeric characters (except -, _, and .) are replaced with a percent sign followed by a two-digit hexadecimal number. For example, spaces are converted to signs, exclamation marks are converted to!, and Chinese characters are converted to their UTF-8 encoding form. When using, only the parameter values ??should be encoded, not the entire URL, to avoid damaging the URL structure. For other parts of the URL, such as path segments, the rawurlencode() function should be used, which converts the space to . When processing array parameters, you can use http_build_query() to automatically encode, or manually call urlencode() on each value to ensure safe transfer of data. just

You can use substr() or mb_substr() to get the first N characters in PHP. The specific steps are as follows: 1. Use substr($string,0,N) to intercept the first N characters, which is suitable for ASCII characters and is simple and efficient; 2. When processing multi-byte characters (such as Chinese), mb_substr($string,0,N,'UTF-8'), and ensure that mbstring extension is enabled; 3. If the string contains HTML or whitespace characters, you should first use strip_tags() to remove the tags and trim() to clean the spaces, and then intercept them to ensure the results are clean.

There are two main ways to get the last N characters of a string in PHP: 1. Use the substr() function to intercept through the negative starting position, which is suitable for single-byte characters; 2. Use the mb_substr() function to support multilingual and UTF-8 encoding to avoid truncating non-English characters; 3. Optionally determine whether the string length is sufficient to handle boundary situations; 4. It is not recommended to use strrev() substr() combination method because it is not safe and inefficient for multi-byte characters.

To set and get session variables in PHP, you must first always call session_start() at the top of the script to start the session. 1. When setting session variables, use $_SESSION hyperglobal array to assign values ??to specific keys, such as $_SESSION['username']='john_doe'; it can store strings, numbers, arrays and even objects, but avoid storing too much data to avoid affecting performance. 2. When obtaining session variables, you need to call session_start() first, and then access the $_SESSION array through the key, such as echo$_SESSION['username']; it is recommended to use isset() to check whether the variable exists to avoid errors

Key methods to prevent SQL injection in PHP include: 1. Use preprocessing statements (such as PDO or MySQLi) to separate SQL code and data; 2. Turn off simulated preprocessing mode to ensure true preprocessing; 3. Filter and verify user input, such as using is_numeric() and filter_var(); 4. Avoid directly splicing SQL strings and use parameter binding instead; 5. Turn off error display in the production environment and record error logs. These measures comprehensively prevent the risk of SQL injection from mechanisms and details.
