I have a lot of web pages saved from one website in my hard drive (+4000). Those saved web pages consist of one html and one folder where pictures, css, js, etc... are stocked.
Since they were barely saved from the same website which have the same pictures, css, js, etc..., there is over 4000 duplicata of the same pictures, css, js, etc...!
My idea was to merge all those folders which contains the same things into one folder "data", and then edit all the links of pictures in the htmls to point to the same folder "data". This way there won't be any duplicatas !
For instance, let's say the name of the html is "How to animate a 3d model.html". It's folder would be "How to animate a 3d model_files" and the code that refers to this folder in the html would be src="How%20to%20animate%20a%203d%20model_files/picture.jpg".
So I was thinking to do this job with a loop that would replace all paths in the htmls and merge all data folders into one, but this seems to be over my batch knoweldge :(
Can anyone help me ?? Or at least give a me link that would help me ?