Python把csv数据写入list和字典类型的变量脚本方法

2025-07-03 13:24:45

如下所示：

#coding=utf8
import csv
import logging
logging.basicConfig(level=logging.DEBUG,
        format='%(asctime)s %(filename)s[line:%(lineno)d] %(levelname)s %(message)s',
        datefmt='%a, %d %b %Y %H:%M:%S',
        filename='readDate.log',
        filemode='w')
'''
该模块的主要功能，是根据已有的csv文件，
通过readDataToDicl函数，把csv中对应的部分，
写入字典中，每个字典当当作一条json数据
'''
class GenExceptData(object):
  def __init__(self):
    try:
      #存放csv中读取的数据
      self.mdbuffer=[]
      #打开csv文件，设置读的权限
      csvHand=open("20170510174450.csv","r")
      #创建读取csv文件句柄
      readcsv=csv.reader(csvHand)
      #把csv的数据读取到mdbuffer中
      for row in readcsv:
          self.mdbuffer.append(row)
      #把数据穿件为为字典类型的
      #self.readDataToList()
      #保存文件
    except Exception,e:
      logging.error("Read Excel error:"+e)
    finally:
      #关闭csv文件
      csvHand.close()

  def readDataToList(self):
    try:
      #获取mdbuffer中的元素个数
      rowNumber=len(self.mdbuffer)
      #设置当前行号
      currentrow=1
      #设置json数据的属性值
      propertyJson={}
      #propertyJsonList=[]
      #count=0
      #读取列表中的元素
      dataList=[]
      try:
        for row in range(1,rowNumber):
          #创建一个临时变量用来存取一次循环的属性键值
          temp={}

          #获取列表中一个元素
          item=self.mdbuffer[row]
          #获取当前元素，当前元素代表的是每个
          #事件起始的位置
          currentItem=self.mdbuffer[currentrow]
          #获取serviceId并进行解码
          serviceId= currentItem[2].decode("gbk")
          #获取属性并进行解码，把解码的值存入propertyName
          propertyName=item[3].decode("gbk")
          #获取属性值并进行解码，把解码的值存入propertyValue
          propertyValue=item[4].decode("gbk")
          try:
            #判断埋点事件与serviceId是否相等
            if item[0]==currentItem[0] and item[2]==currentItem[2]:
              #把serviceId方式字典propertyJson中
              propertyJson["serviceId"]=serviceId
              #把属性/值对放入temp字典中
              temp[propertyName]=propertyValue
              #调用字典的update函数，把temp中的键值对
              #添加到 propertyJson字典中
              propertyJson.update(temp)
              #使用continue，如果为if条件为true则循环执行if语句模块
              continue
            else:
              #把行号设置为当前行
              currentrow=row
              #把当前的属性解码放入propertyName
              propertyName=currentItem[3].decode("gbk")
              #把当前的属性值解码放入propertyName
              propertyValue=currentItem[4].decode("gbk")
              #把serviceId方式字典propertyJson中
              propertyJson["serviceId"]=serviceId
              #把属性/值对放入propertyJson字典中
              propertyJson[propertyName]=propertyValue
              #propertyJsonList.append(propertyJson)
              dataList.append(propertyJson)
              '''
              在这说下：
              propertyJson.clear()与propertyJson={}的区别：
              propertyJson.clear()是删除字典的值，不创建引用，会改变字典本身的值；
              propertyJson={}是创建新的引用，字典的中的值不发现变化；
              如果想让 self.dataDic.append(propertyJson)该语句执行成功，而且添加每次循环的值，
              需要使用propertyJson={}方法；
              如果使用propertyJson.clear(),只会把最后一次propertyJson存储的值，添加到self.dataDic中
              '''
              propertyJson={}
          except Exception,e:
            logging.error("Get Property Json Error:"+e)
            print "Get Property Json Error:",e
      except Exception,e:
        logging.error("Get Date Error:"+e)
        print "Get Date Error:",e
      return dataList
    except Exception,e:
      logging.error("Reading Data TO Dic Error:"+e)
      print "Reading Data TO Dic Error:",e

  def getAllServiceId(self):
    try:
      dataList=self.readDataToList()
      serList=[item["serviceId"] for item in dataList if item["serviceId"] ]
      serList=list(set(serList))
      return serList
    except Exception,e:
      logging.error("Create ServiceId List Error:"+e)
      print "Create ServiceId List Error:"+e

  def oupPutData(self):
    try:
      dataList=self.readDataToList()
      for item in dataList:
          print "{"
          for key,val in item.items():
            print key,":",val
          print "}"
          print "#"*50
    except Exception,e:
      logging.error("OutPut Data Error:"+e)
      print "OutPut Data Error:"+e

  def createDataDic(self):
    try:
      dataDic={}

      dataList=self.readDataToList()
      count=0
      for item in dataList:
        if item["serviceId"]==u"pageview":
          count+=1
      print count

      serviceIdList=self.getAllServiceId()
      if len(serviceIdList)>0 and len(dataList)>0:
        for serviceId in serviceIdList:
          sameServiceidJosnList=[]
          for item in dataList:
            itemServiceId=item["serviceId"]
            if itemServiceId:
              if serviceId==itemServiceId:
                sameServiceidJosnList.append(item)
            else:
              print "ServiceId is null"
          dataDic[serviceId]=sameServiceidJosnList 

      else:
        print "seriviceIdList or dataList is null"
      return dataDic
      '''
      for key,val in dataDic.items():
        print key,len(val)
        print "*"*50
        for item in val:
          print "{"
          for ke,va in item.items():
            print ke,":",va
          print "}"
        print "-"*50
      '''
    except Exception,e:
      print "Create Data Dictionary Error:",e 

def test():
  gen =GenExceptData()
  gen.oupPutData()

if __name__=="__main__":
  test()

以上这篇Python把csv数据写入list和字典类型的变量脚本方法就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持我们。

Python 可变类型和不可变类型及引用过程解析

在Python中定义一个数据便在内存中开辟一片空间来存储这个变量的值,这块已经被分配的内存空间便会有一个内存地址.访问这块内存需要用到变量名,变量名实际存储的是变量的地址在内存中的地址,但是使用print()函数得到的确实这块内存中实际的地址.这个就当成规定就好,没必要纠结. 获得变量在内存中的地址,可以使用id()函数. 常规使用变量之间的赋值其实是把一个变量的内存地址传递给另一个变量,这样两个变量便都指向内存中的同一块空间,因此这两个变量的之是相等的 a = 1 b = a print("
详细解析Python当中的数据类型和变量

数据类型计算机顾名思义就是可以做数学计算的机器,因此,计算机程序理所当然地可以处理各种数值.但是,计算机能处理的远不止数值,还可以处理文本.图形.音频.视频.网页等各种各样的数据,不同的数据,需要定义不同的数据类型.在Python中,能够直接处理的数据类型有以下几种: 整数 Python可以处理任意大小的整数,当然包括负整数,在程序中的表示方法和数学上的写法一模一样,例如:1,100,-8080,0,等等. 计算机由于使用二进制,所以,有时候用十六进制表示整数比较方便,十六进制用0x前缀和0-
pycharm下查看python的变量类型和变量内容的方法

用过Matlab的同学基本都知道,程序里面的变量内容可以很方便的查看到,但python确没这么方便,对于做数据处理的很不方便,其实不是没有这个功能,只是没有发现而已,今天整理一下供大家相互学习. 首先,在程序的某一处添加断点,点击行号右边部分红处,如下图所示: 添加断点后,选择debug程序,快捷键在pycharm的右上角. debug过程中,pycharm的下方工作区域内会相应显示: Variables窗口中的变量可以右击,Add to Watches,然后在Watches窗口中可以看到所选数
浅谈python中的变量默认是什么类型

1.type(变量名),输出的结果就是变量的类型: 例如 >>> type(6) <type 'int'> 2.在Python里面变量在声明时,不需要指定变量的类型,变量的类型是动态指定的:>>> x=5 >>> type(x) <type 'int'> >>> x="wang" >>> type(x) <type 'str'> 3.也就是说变量的类型,根据给出
Python中查看变量的类型内存地址所占字节的大小

Python中查看变量的类型,内存地址,所占字节的大小查看变量的类型 #利用内置type()函数 >>> nfc=["Packers","49"] >>> afc=["Ravens","48"] >>> combine=zip(nfc,afc) >>> type(combine) <class 'zip'> 查看变量的内存地址 #利用内置函数
Python中请使用isinstance()判断变量类型

一.isinstance() 在Python中可以使用type()与isinstance()这两个函数判断对象类型,而isinstance()函数的使用上比type更加方便. 复制代码代码如下: # coding=utf-8 a = 10 def b(): pass print isinstance(a,(int,str)) print isinstance(a,(float,str)) print isinstance(b,(str,int)) class c:
Python把csv数据写入list和字典类型的变量脚本方法

如下所示: #coding=utf8 import csv import logging logging.basicConfig(level=logging.DEBUG, format='%(asctime)s %(filename)s[line:%(lineno)d] %(levelname)s %(message)s', datefmt='%a, %d %b %Y %H:%M:%S', filename='readDate.log', filemode='w') ''' 该模块的主要功能,是
Python将列表数据写入文件（txt, csv，excel）

写入txt文件 def text_save(filename, data):#filename为写入CSV文件的路径,data为要写入数据列表. file = open(filename,'a') for i in range(len(data)): s = str(data[i]).replace('[','').replace(']','')#去除[],这两行按数据不同,可以选择 s = s.replace("'",'').replace(',','') +'\n' #去除单引号,
python处理csv数据动态显示曲线实例代码

本文研究的主要是python处理csv数据动态显示曲线,分享了实现代码,具体如下. 代码: # -*- coding: utf-8 -*- """ Spyder Editor This temporary script file is located here: C:\Users\user\.spyder2\.temp.py """ """ Show how to modify the coordinate form
python学习将数据写入文件并保存方法

python将文件写入文件并保存的方法: 使用python内置的open()函数将文件打开,用write()函数将数据写入文件,最后使用close()函数关闭并保存文件,这样就可以将数据写入文件并保存了. 示例代码如下: file = open("ax.txt", 'w') file.write('hskhfkdsnfdcbdkjs') file.close() 执行结果: 内容扩展: python将字典中的数据保存到文件中 d = {'a':'aaa','b':'bbb'} s =
使用python生成大量数据写入es数据库并查询操作(2)

目录方案一方案二 1.顺序插入5000000条数据 2.批量插入5000000条数据 3.批量插入50000000条数据前言 : 上一篇文章:如何使用python生成大量数据写入es数据库并查询操作模拟学生个人信息写入es数据库,包括姓名.性别.年龄.特点.科目.成绩,创建时间. 方案一在写入数据时未提前创建索引mapping,而是每插入一条数据都包含了索引的信息. 示例代码:[多线程写入数据][一次性写入10000*1000条数据] [本人亲测耗时3266秒] from elast
python处理csv数据的方法

本文实例讲述了python处理csv数据的方法.分享给大家供大家参考.具体如下: Python代码: 复制代码代码如下: #coding=utf-8 __author__ = 'dehua.li' from datetime import * import datetime import csv import sys import time import string import os import os.path import pylab as plt rootdir='/nethome/
Python实现将数据写入netCDF4中的方法示例

本文实例讲述了Python实现将数据写入netCDF4中的方法.分享给大家供大家参考,具体如下: nc文件为处理气象数据文件.用户可以去https://www.lfd.uci.edu/~gohlke/pythonlibs/ 搜索netCDF4,下载相应平台的whl文件,使用pip安装即可. 这里演示的写入数据操作代码如下: # -*- coding:utf-8 -*- import numpy as np ''' 输入的data的shape=(627,652) ''' def write_to_
如何使用python生成大量数据写入es数据库并查询操作

前言: 模拟学生成绩信息写入es数据库,包括姓名.性别.科目.成绩. 示例代码1:[一次性写入10000*1000条数据] [本人亲测耗时5100秒] from elasticsearch import Elasticsearch from elasticsearch import helpers import random import time es = Elasticsearch(hosts='http://127.0.0.1:9200') # print(es) names = ['刘
Python中让MySQL查询结果返回字典类型的方法

Python的MySQLdb模块是Python连接MySQL的一个模块,默认查询结果返回是tuple类型,只能通过0,1..等索引下标访问数据默认连接数据库: 复制代码代码如下: MySQLdb.connect( host=host, user=user, passwd=passwd, db=db, port=port, charset='utf8' ) 查询数据: 复制代码代码如下: cur = co
python中for循环把字符串或者字典添加到列表的方法

python中如何for循环把字符串添加到列表? 实例: 1.单个字符串用for循环添加到列表中: # 把L1中的字符串添加到列表alist里面 L1 = 'MJlifeBlog' alist = [] # 可以用forin来迭代L1并保存值到x变量里头即可. # 接着在for循环里边用append方法即可把解析到的单个字符添加到列表了. for x in L1: alist.append(x) print(alist) 2.多个字符串用for循环添加到列表中: # 如果需要把多个字符串添加到列

Python把csv数据写入list和字典类型的变量脚本方法

相关推荐

随机推荐