I'm a newbie to R and have a business problem to solve.
We would like to analyze the near duplicate requests for materials posted by our end users to our procurement department.
This will help us to identify most commonly requested materials and to codify them as a stock item, and possibly identify the suppliers who give good rates.
The material description is a free flow text and different interpretations exists, like, for eg, "3 inch" is written as "3 inch", "3 in", 3" (double quotes), "3 inches", "3 inch", etc.
Your help is much appreciated.