


How to compare two strings for similarity in PHP with similar_text or levenshtein
Jul 11, 2025 am 03:20 AMIn PHP, similar_text() and levenshtein() functions are mainly used to compare string similarity; 1. Similar_text() returns a similar percentage or matched number of characters, suitable for fuzzy search and repeated content detection; 2. levenshtein() returns an editing distance, suitable for automatic error correction and input correction; 3. The selection basis is the requirement: if the percentage is required, select similar_text(), and if the number of steps is required, use levenshtein().
Comparing the similarity between two strings is actually quite common in PHP, such as it is used when doing fuzzy searches, spelling checks, or data cleaning. PHP provides two more practical functions: similar_text()
and levenshtein()
, which can help us determine how close the two strings are.

Let me briefly talk about the difference:
-
similar_text()
returns a percentage of two strings similar (can also return the number of matching characters) -
levenshtein()
returns the edit distance, which is how many operations (add, delete, replace) are required to make the two strings the same
Let’s see how to use it.

How to use similar_text() to determine string similarity
This function is relatively simple and direct, suitable for beginners or scenarios that only require roughly judging the similarity.
Basic usage:

similar_text("apple", "appel", $percent); echo $percent; // Output a value similar to 80 or 80.0
Some points to note:
- The third parameter is to refer to the value, so the first step is to add
&
- The default return is the number of matching characters. If the
$percent
parameter is passed, the percentage will be filled. - The percentage calculation method is
(匹配字符數(shù)* 200) / (字符串1長(zhǎng)度字符串2長(zhǎng)度)
, so it is not an absolute value, but it can be used to compare
Examples of applicable scenarios:
- The user enters "Beijing". If you want to check "Beijing", you can use this to determine whether it is similar.
- Use as an auxiliary means when detecting duplicate content
How to use levenshtein()? What is suitable for?
If you are more concerned with the "gap" between two strings than the "similarity", then levenshtein()
is more appropriate.
Basic examples:
echo levenshtein("hello", "helo"); // Output 1
It returns the minimum number of modifications. In the example above, "hello" and "helo" are only one letter l apart, so the result is 1.
Some details:
- Supports up to 255 characters, if it exceeds it, it will return -1
- Supports custom insertion, replacement, and deletion costs, such as:
levenshtein($str1, $str2, 1, 2, 3); // Insert cost 1, replace cost 2, delete cost 3
Typical application scenarios:
- Automatic error correction, such as if the user typed the wrong word, the system recommends the closest correct spelling
- Preprocessing before database query to correct possible input errors
Which one is better? How to choose?
There is no standard answer to this question, it depends on what the problem you are trying to solve:
- If you want an intuitive "similar percentage", use
similar_text()
first - If you need to know how many steps there is between two strings, use
levenshtein()
For example:
- When looking for similar names, you may first use
levenshtein()
to find the result of the minimum editing distance, and then usesimilar_text()
to see if the similarity meets the standard. - In terms of performance, both are similar, but if the data volume is large, it is recommended to test which scenario is more suitable for you
Basically that's it. Both functions are not complicated, but they are very practical in practical applications. As long as you understand your respective characteristics, it is not difficult to choose.
The above is the detailed content of How to compare two strings for similarity in PHP with similar_text or levenshtein. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

TosecurelyhandleauthenticationandauthorizationinPHP,followthesesteps:1.Alwayshashpasswordswithpassword_hash()andverifyusingpassword_verify(),usepreparedstatementstopreventSQLinjection,andstoreuserdatain$_SESSIONafterlogin.2.Implementrole-basedaccessc

To safely handle file uploads in PHP, the core is to verify file types, rename files, and restrict permissions. 1. Use finfo_file() to check the real MIME type, and only specific types such as image/jpeg are allowed; 2. Use uniqid() to generate random file names and store them in non-Web root directory; 3. Limit file size through php.ini and HTML forms, and set directory permissions to 0755; 4. Use ClamAV to scan malware to enhance security. These steps effectively prevent security vulnerabilities and ensure that the file upload process is safe and reliable.

In PHP, the main difference between == and == is the strictness of type checking. ==Type conversion will be performed before comparison, for example, 5=="5" returns true, and ===Request that the value and type are the same before true will be returned, for example, 5==="5" returns false. In usage scenarios, === is more secure and should be used first, and == is only used when type conversion is required.

The methods of using basic mathematical operations in PHP are as follows: 1. Addition signs support integers and floating-point numbers, and can also be used for variables. String numbers will be automatically converted but not recommended to dependencies; 2. Subtraction signs use - signs, variables are the same, and type conversion is also applicable; 3. Multiplication signs use * signs, which are suitable for numbers and similar strings; 4. Division uses / signs, which need to avoid dividing by zero, and note that the result may be floating-point numbers; 5. Taking the modulus signs can be used to judge odd and even numbers, and when processing negative numbers, the remainder signs are consistent with the dividend. The key to using these operators correctly is to ensure that the data types are clear and the boundary situation is handled well.

Yes, PHP can interact with NoSQL databases like MongoDB and Redis through specific extensions or libraries. First, use the MongoDBPHP driver (installed through PECL or Composer) to create client instances and operate databases and collections, supporting insertion, query, aggregation and other operations; second, use the Predis library or phpredis extension to connect to Redis, perform key-value settings and acquisitions, and recommend phpredis for high-performance scenarios, while Predis is convenient for rapid deployment; both are suitable for production environments and are well-documented.

TostaycurrentwithPHPdevelopmentsandbestpractices,followkeynewssourceslikePHP.netandPHPWeekly,engagewithcommunitiesonforumsandconferences,keeptoolingupdatedandgraduallyadoptnewfeatures,andreadorcontributetoopensourceprojects.First,followreliablesource

PHPbecamepopularforwebdevelopmentduetoitseaseoflearning,seamlessintegrationwithHTML,widespreadhostingsupport,andalargeecosystemincludingframeworkslikeLaravelandCMSplatformslikeWordPress.Itexcelsinhandlingformsubmissions,managingusersessions,interacti

TosettherighttimezoneinPHP,usedate_default_timezone_set()functionatthestartofyourscriptwithavalididentifiersuchas'America/New_York'.1.Usedate_default_timezone_set()beforeanydate/timefunctions.2.Alternatively,configurethephp.inifilebysettingdate.timez
