我正在尝试从上传的 word doc (.doc, .docx, .rtf)
中获取字数,但它总是带有烦人的 Word 格式.
I am trying to grab a word count from an uploaded word doc (.doc, .docx, .rtf)
but it always carries through the annoying Word formatting.
有人之前解决过这个问题并知道如何解决吗?谢谢:)
Anybody tackled this issue before and know how to solve it? Thanks :)
您需要:
区分文件类型
Distinguish the file type
$file_name = $_FILES['image']['name'];
$file_extn = end(explode(".", strtolower($_FILES['image']['name'])));
if($file_extn == "doc" || $file_extn == "docx"){
docx2text();
}elseif($file_extn == "rtf"){
rtf2text();
}
将文档转换为文本
Convert the document to text
https://stackoverflow.com/a/7371315/2512934 用于 doc 或 docxhttp://webcheatsheet.com/php/reading_the_clean_text_from_rtf.php 用于 rtf
https://stackoverflow.com/a/7371315/2512934 for doc or docx http://webcheatsheet.com/php/reading_the_clean_text_from_rtf.php for rtf
数词http://php.net/manual/en/function.str-word-count.php
这篇关于PHP - 从上传的 Microsoft Word 文档中获取字数的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!