MySQL: Count occurrences of words in a column

What?
This is an article to document how to output the most frequently used words in a MySQL database column.

How?
Note this only applies to MySQL. So the following query will return a count of each word in your database column by order of most used first (replace 'mydbprefix_content' with the name of your database table AND `introtext` with the name of your column):
SELECT SUM(`total_count`) as `total`, `value`
FROM (
    SELECT count(*) AS `total_count`, REPLACE(REPLACE(REPLACE(x.`value`,'?',''),'.',''),'!','') as `value`
    FROM (
        SELECT SUBSTRING_INDEX(SUBSTRING_INDEX(ExtractValue(t.`introtext`, '//text()'), ' ', n.n), ' ', -1) `value`
        FROM `mydbprefix_content` t CROSS JOIN 
        (
            SELECT a.N + b.N * 10 + 1 n
                FROM 
                (SELECT 0 AS N UNION ALL SELECT 1 UNION ALL SELECT 2 UNION ALL SELECT 3 UNION ALL SELECT 4 UNION ALL SELECT 5 UNION ALL SELECT 6 UNION ALL SELECT 7 UNION ALL SELECT 8 UNION ALL SELECT 9) a
                ,(SELECT 0 AS N UNION ALL SELECT 1 UNION ALL SELECT 2 UNION ALL SELECT 3 UNION ALL SELECT 4 UNION ALL SELECT 5 UNION ALL SELECT 6 UNION ALL SELECT 7 UNION ALL SELECT 8 UNION ALL SELECT 9) b
                ORDER BY n
        ) n
        WHERE n.n <= 1 + (LENGTH(ExtractValue(t.`introtext`, '//text()')) - LENGTH(REPLACE(ExtractValue(t.`introtext`, '//text()'), ' ', '')))
        ORDER BY `value`
    ) AS x
    GROUP BY x.`value`
) AS y
WHERE LENGTH(`value`)>5
AND TRIM(`value`) NOT IN ('Article','This',' ','content','component','applies','because','doesn''t')
GROUP BY `value`
ORDER BY `total` DESC
LIMIT 0,50;
Additional Note(s):
  • I have added a WHERE clause which omits words less than 5 characters in length, and some other words not to count.
  • I have added the ORDER BY to give me the highest count first in descending order.
  • I have added a LIMIT to only return the top 50 words.

Source(s):

Credit where Credit is Due:


Feel free to copy, redistribute and share this information. All that we ask is that you attribute credit and possibly even a link back to this website as it really helps in our search engine rankings.

Disclaimer: Please note that the information provided on this website is intended for informational purposes only and does not represent a warranty. The opinions expressed are those of the author only. We recommend testing any solutions in a development environment before implementing them in production. The articles are based on our good faith efforts and were current at the time of writing, reflecting our practical experience in a commercial setting.

Thank you for visiting and, as always, we hope this website was of some use to you!

Kind Regards,

Joel Lipman
www.joellipman.com

Related Articles

Joes Revolver Map

Accreditation

Badge - Certified Zoho Creator Associate
Badge - Certified Zoho Creator Associate

Donate & Support

If you like my content, and would like to support this sharing site, feel free to donate using a method below:

Paypal:
Donate to Joel Lipman via PayPal

Bitcoin:
Donate to Joel Lipman with Bitcoin bc1qf6elrdxc968h0k673l2djc9wrpazhqtxw8qqp4

Ethereum:
Donate to Joel Lipman with Ethereum 0xb038962F3809b425D661EF5D22294Cf45E02FebF
© 2024 Joel Lipman .com. All Rights Reserved.