[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y ] [Home]
4chanarchives logo
Deleting string of characters in Notepad ++
Images are sometimes not shown due to bandwidth/network limitations. Refreshing the page usually helps.

You are currently reading a thread in /g/ - Technology

Thread replies: 24
Thread images: 5
File: BTaKGgn9c.jpg (27 KB, 323x301) Image search: [Google]
BTaKGgn9c.jpg
27 KB, 323x301
Does anyone have any idea how I can delete the numbers in this text (the string on numbers that come after the article titles). I'm using Notepad++ and have about 50 thousand lines in the file.


2015–16 Missouri State Lady Bears basketball team 54752527117 20791 32394002801
Gesellschafts- und Wirtschaftsmuseum 54752559838 2708 32394005509
Timothy Hodge (swimmer) 54752564340 2909 32394008418
Ediacara Conservation Park 54752575029 5819 32394014237
Candice Lin 54752585298 2762 32394016999
Sarah Nagourney 54752591883 11158 32394028157
Alabama Wing Civil Air Patrol 54752603582 3995 32394032152
Zhu Qingyi 54752614931 2743 32394034895
The Suicide Theory 54752621562 9411 32394044306
2015–16 Southern Illinois Salukis women's basketball team 54752632145 20451 32394064757
History of swimwear 54752653835 39735 32394104492
Hum Sab Ustad Hain 54752699584 2541 32394107033
Gerald R. Johnson 54752708541 4775 32394111808
Kirsten Moana Thompson 54752716436 10061 32394121869
Astoria (Marianas Trench album) 54752730479 15080 32394136949
Franz Rauscher 54752775706 1075 32394138024
GCU Soccer Stadium 54752783307 4976 32394143000
Las secretas intenciones 54752794678 4306 32394147306
Arthur and Merlin (film) 54752810186 3485 32394150791
Salome Þorkelsdóttir 54752816152 4183 32394154974
Winifred Kiek 54752820897 6366 32394161340
The Dead House 54752827802 10441 32394171781
Richa Maheshwari 54752838976 1920 32394173701
Central Square, Chennai 54752844294 1497 32394175198
Vienna Party School 54752854391 1248 32394176446
Chair of Saint Peter (disambiguation) 54752858618 301 32394176747
Felix Russo 54752860052 5849 32394182596
Mitratech Holdings Inc. 54752880115 9214 32394191810
Armenia national football team results (2010s) 54752898364 19459 32394211269
>>
yes, I know
>>
Search and replace. Apparently notepad++ needs an addon for regex searching/replacing, so your best bet is to make a small script in your favorite scripting language.
>>
>>51624958
me too
>>
>>51624958
>>51624970
yeah lads I agree it was pretty easy
>>
>>51624893

Save to file.
Use awk (or sed, if you're feeling funky).
>>
If it helps, there are tabs between the last number strings, so it goes like;

2015–16 Missouri State Lady Bears basketball team [tab] 54752527117 [tab] 20791 [tab] 32394002801

Anyone know a script I can use in Notepad ++ or any other program
>>
>>51625108
>or any other program

Ok I give in. What other programs do you have available? got perl?
>>
>>51625105

lrn2unix bro

sed -i.backup 's:[[:digit:]]*$::' <FILE>
>>
>>51625145

I'm using notepad++, notepad2, and emeditor

I'm trying to dump the entire english wikipedia to an offline device called wikireader, and the last 50 thousand articles won't render (convert to wikireader format). I need to create a list of those articles to use Special:export on wikipedia to manually download them
>>
>>51625179

Shit, it's got spaces.

's:[[:digit:] ]*$::'
>>
Can someone do the job for me....pretty please.

filedropper com / list_1
>>
>>51624893
Copy shit to https://regex101.com/

/w
>>
File: 1ksLN1s.jpg (3 MB, 4013x2866) Image search: [Google]
1ksLN1s.jpg
3 MB, 4013x2866
>>51625239
NO STEVE THIS IS ELEMENTARY STUFF JUST READ UP ON REGEX FOR 10 MINUTES
>>
>>51625240

Ok, I copied it, now what
>>
File: ss+(2015-12-01+at+06.00.46).png (65 KB, 1429x650) Image search: [Google]
ss+(2015-12-01+at+06.00.46).png
65 KB, 1429x650
>>51625269
>>
File: 771247.jpg (19 KB, 600x346) Image search: [Google]
771247.jpg
19 KB, 600x346
>>51625262

Life sucks
>>
http://www.kiwix.org/wiki/Main_Page
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Wikipedia dump, compressed and with viewer.

https://www.cygwin.com/
^^^^^^^^^^^^^^^^^^^^^^^^
GNU shell and utilities for Windows.

http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO.html

http://www.tldp.org/LDP/Bash-Beginners-Guide/html/Bash-Beginners-Guide.html

http://www.tldp.org/LDP/abs/html/
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Guides to the above

You should be able to use cygwin to run >>51625179
>>
>>51625283

I only need to get rid of the last three sets of numbers on each line. Otherwise Special:Export won't work
>>
just import into any spreadsheet program using tab delimiting, copy-paste first column.
>>
something like. ^(. +)(\D+)(.+)$

replace with \1\2

learn regex fagget
>>
>>51625239
>filedropper com / list_1
i noticed there was a bunch of space at the start of every number sequence..so i just used simple regex to delete everything after that space, lmao

[    ].*


ghetto as fuck but see if its clean enough yourself

http://a.pomf.cat/ckammb.txt
>>
File: easy as fuck.png (129 KB, 1000x1057) Image search: [Google]
easy as fuck.png
129 KB, 1000x1057
wow i'm rusty the answer was simple as fuck, but i was trying to do in a way more complicated than needed
>>
>>51624893
Use a real text editor.
Thread replies: 24
Thread images: 5

banner
banner
[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y] [Home]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
If a post contains personal/copyrighted/illegal content you can contact me at [email protected] with that post and thread number and it will be removed as soon as possible.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com, send takedown notices to them.
This is a 4chan archive - all of the content originated from them. If you need IP information for a Poster - you need to contact them. This website shows only archived content.