others: - added basic data analysis to get histograms of text differences - added new final delivery model
Doc: added descriptions and instructions for the data_preprocess folder