pandas read a text file and split the names into columns based on the first character

krock1516

Hi I'm looking to forward to see if we can read a text file and place them into separate columns based on the first character with pandas.

Below is the text file

$ cat file.txt
AAAAAA
AAAAAA
AAAAAA
AAAAAA
AAAAAA
BBBBBB
BBBBBB
BBBBBB
BBBBBB
BBBBBB
CCCCCC
CCCCCC
CCCCCC
CCCCCC
CCCCCC
DDDDDD
DDDDDD
DDDDDD
DDDDDD
DDDDDD
EEEEEE
EEEEEE
EEEEEE
EEEEEE
EEEEEE
FFFFFF
FFFFFF
FFFFFF
FFFFFF
FFFFFF

Desired:

COL_1   COL_2   COL_3   COL_4   COL_5   COL_6
AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF

Quang Hoang

Probably not the best way:

# notice the header=None option
df = pd.read_csv('file.txt', header=None)

# extract the first character of the string
df['start'] = df[0].str[0]

# group by the first character of the string
# cumcount gives you the order/rank of the row within its group
df['idx'] = df.groupby('start').cumcount()

# pivot - search StackOverflow for 47152691
df.pivot(index='idx', columns='start', values=0)

Output:

start       A       B       C       D       E       F
idx                                                  
0      AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
1      AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
2      AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
3      AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF
4      AAAAAA  BBBBBB  CCCCCC  DDDDDD  EEEEEE  FFFFFF

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2020-12-7

Comments

0 comments

Read csv file and split in columns keeping column names. Pandas

Python read text file and split on control character

pandas read a text file and split the names into columns based on the first character

pandas read a text file and split the names into columns based on the first character

Desired:

pump.io port in URL

How to import an asset in swift using Bundle.main.path() in a react-native native module

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Double spacing in rmarkdown pdf

SQL Server : need add a dot before two last character

Ambiguous use of 'init' with CFStringTransform and Swift 3

Resetting Value of <input type="time"> in Firefox

Retrieve Element Tag Value XML Using Bash

How to pass data to the ng2-bs3-modal?

JWT gives JsonWebTokenError "invalid token"

How to update azerothcore-wotlk docker container

C++ 16 bit grayscale gradient image from 2D array

redirect your computer port to url

Capybara Selenium Chrome opens About Google Chrome

mysql.connector.errors.InterfaceError: 2003: Can't connect to MySQL server on '127.0.0.1:3306' (111 Connection refused)

How to make thrown errors visible outside of a Promise?

JMeter: Why get error when try to save test plan

Should you provide dependent libraries in client jar?

Issue making model pop up onPress of flatlist

Message: element not interactable on accessing a tag python

Calling Doctrine clear() with an argument is deprecated