problem description recently plans to develop an intelligent chat robot, similar to jd, intelligent customer service on Taobao, more questions and answers currently there are 100w question and answer data in mysql. When users ask a question, they h...
recently, I want to process the form for analysis. Due to the lack of experience in dealing with sql or csv, I have encountered the following problems and asked to solve them: question 1: companyID fresh 23865 13-2 23866 10-3 23867 8-...
ask the gods for help. original data: 1 192.168.106.10 1 192.168.106.16 3 192.168.106.11 2 192.168.106.3 2 192.168.106.8 2 192.168.106.6 how to reorder the combination and return it into a dictionary in the following format: {1: [192.168.106....
how to query data in B dataframe according to the ID, of A dataframe in pandas, An and B tables have the same ID column. Similar to SQL s column-level subquery: select * from t_class where c_id in (select c_class_id from t_student) , An and B have an o...
from pyecharts import Pie attr = [ 2dlots, 3d ] v1 = [12,5] pie = Pie ( "Chinese 2017 movie visual type ratio ", title_pos= left ,title_text_size=15,width=1000,height=350) pie.add ( ", attr,v1,center= [50,60], radius= [0,80], is_label_show=Tr...
how can a data analysis project be versioned effectively? where version control is required and which is not. how to manage the charts generated in the data analysis project. ...
suppose there are 100 questions on a test paper, each with a score of 1 out of 100. Each question has four options (all single-choice questions). now many people have done this paper, and we have all the answer information and scores. Note: we do not k...
Table 1 add column matching Table 2 key _ word, if Table 1 add contains Table 2 key_word, write group, in Table 1 column 4, if not, mark "not included ". PS: I also measured with iteration, Cartesian product also tried, but did not achieve the desired ...
such as question: how to set chunksize and shape? of dask.array the code is as follows >>> x = data.values >>> x dask.array<values, shape=(nan, 24), dtype=int64, chunksize=(nan, 24)> >>> y = data[ label ].values >...
my purpose is to obtain an array of latitude and longitude coordinates for the whole country or a province or city, and these coordinate arrays are separated by 3 kilometers. (in other words, draw a circle with a radius of 3 km, approximately filling th...
coding can be implemented in all four of the above methods. What s the difference between them? Of course, the length of the script is different. I can already see it. < H1 > get_dummies method < H1 > df7 = DataFrame({ key :list( bbacab ), ...