![]() As you become more proficient, using the command line for some data science tasks is much quicker than writing a Python script or a Hadoop job. In these situations, proficiency with command line data science is a true superpower. Great question! When working in cloud data science environments, you sometimes only have access to a server’s shell. In this post, we’ll learn about how to use the csvkit library to acquire and explore tabular data. In a previous post, we discussed how we used Python and Pandas to clean the dataset. ![]() ![]() The dataset has some data quality issues, however, and requires cleanup. The Museum of Modern Art is one of the most influential museums in the world and they have released a dataset on the artworks in their collection. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |