Timeline for How to remove all the duplicated words on every line using Notepad++?
Current License: CC BY-SA 3.0
10 events
when toggle format | what | by | license | comment | |
---|---|---|---|---|---|
Jul 27, 2012 at 6:57 | history | edited | amiregelz | CC BY-SA 3.0 |
Last edit to add more info on how the regex will operate.
|
Jul 27, 2012 at 0:31 | vote | accept | Gabriel | ||
Jul 27, 2012 at 0:16 | comment | added | amiregelz | Another problem is that most of the words that are duplicated also appear in the previous line, which makes it difficult to achieve your goal. If you'd want to remove all duplicated words, then it wasn't that difficult. You could do something like this & this. I suggest you use specific regular expressions in Notepad++ (I can help you, just tell me all the formats of the duplicated words) or consider a different approach to your problem. | |
Jul 26, 2012 at 23:48 | comment | added | amiregelz |
The problem is that the regular expression I wrote in my answer only applies to the format (I thought) you asked for: word, word, anotherword . However, you have many instances which are a little bit different, like came across, came across and some with 3 or 4 words. Also there are words with ' like don't and it makes things more complicated in the Notepad++ regex system. The Notepad++ regex system is pretty annoying and limited as well, so the solution is to either use regex in python (or another language), or make format-specific regular expressions for the Notepad++.
|
|
Jul 26, 2012 at 21:59 | comment | added | Gabriel | I tried every possible combination, no commas, only spaces, no space and comma, and yet nothing. Please enlight me, here's the text file: goo.gl/sP20z | |
Jul 26, 2012 at 21:13 | history | edited | amiregelz | CC BY-SA 3.0 |
Better explanation and more useful info.
|
Jul 26, 2012 at 20:40 | comment | added | amiregelz |
This regex should work: (\b\w+\b)\W+\1 for two duplicated words. I'll edit my answer. The commas are why the other regex doesn't work.
|
|
Jul 26, 2012 at 20:28 | comment | added | Gabriel | Thanks for the help! However I'm getting 0 occurrences, I tried doing this with separated keywords as you suggested and it didn't work, I also tried as they were before and nothing, please check my screen capture: goo.gl/eZ7Kp | |
Jul 26, 2012 at 20:09 | history | edited | amiregelz | CC BY-SA 3.0 |
added 235 characters in body
|
Jul 26, 2012 at 20:03 | history | answered | amiregelz | CC BY-SA 3.0 |