Efficiency of massive data deletion in mongodb - Codes Helper - Programming Question Answer

Efficiency of massive data deletion in mongodb

there are two collections
data stores data, and
attach stores attachments
data and attach, one-to-many relationships.
the ratio of data to attach is about 1:10, that is, a piece of data data, and there may be 10-50 pieces of data below.
when data is tens of millions, attach may be hundreds of millions.

the problem now is that if the user deletes the data, then the attach must be deleted accordingly.
manipulating hundreds of millions of pieces of data is a long process. The database is also under a lot of pressure.
so consider whether it can be soft deleted, first set the data update status to delete, and ignore attach. Then use the program to clear it in the background in the later stage.
after all, the waiting time for updating tens of millions of data is not long.
but there is a problem. Attach needs to do data statistics. For example, before the user deletes, he calculates that his attachment occupies 20g of space, and after deletion, you have to give him the amount of attachment after deletion, otherwise the billing is not accurate.

so how to solve this problem?

Mongodb

Aug.07,2021

this is a very common problem. In big data's scenarios, soft deletions are often used instead of deletions, so there is no problem with the solution itself. The first point must be done. If you know anything about GridFS, the record in GridFS's fs.files is actually equivalent to your data , while fs.chunks is equivalent to your attach . In fs.files , there is a series of metadata such as file size, file name, path, and so on.
and now that you've marked the file for deletion, you can simply click on the deletion tag Filter when counting the file size. For example, if you delete it through the isDeleted=true tag, the query you need is:

 
 the solution I can think of now is to calculate the attach size corresponding to each data, and store it in data. After all, the size is fixed. 
 so the size of attachments is calculated from data, and it is much faster from tens of millions of pieces of data than from hundreds of millions of pieces of data, and no matter whether the attach is deleted or not, it does not affect the statistical results. 
< hr >
 I can think of another plan. Data must use uid to identify which user's data, then every time the user empties the operation, I generate a new uid. Then the user's data is completely out of touch with him. Of course, this uid is not the primary key, it's just a temporary identification id, 
 randomly generated each time, and then I record each delete record. Record the old uid each time you delete it. Then the background cleans up his data through the old uid. 
 in this way, every time a user deletes an operation, it responds quickly and does not have to wait. However, as a result, the whole database relationship needs to be recombed, and there is indeed a lot of work


										
												Previous: How to realize js Jump Page by react-route4
												                		Next: Regular matching single-line comment problem
                							
					
						
					
															
						
														
								Mongodb update level 3 or above cannot be deleted
								 Project.updateOne(r, {
    $pull: {
        thunder2: {
            files: {
                _id: "5ab362d446f15936bcea7dd3"
            }
        }
    }
}, function(err, data) {
    if (err) {
        res.send("")
    } else {
     ...
								
									
																														Mongodb
																			
									
										Feb.26,2021
									
								
							
														
								How to add mongodb service under Windows 10?
								
   the previous command is as follows: 
mongod --logpath C:  mydatabase  logs.txt --logappend --dbpath C:  mydatabase --serviceName MongoDB --install
 however, there is no prompt on the command line after entering the carriage enter, and the service has ...
								
									
																														Windows10
																				mongodb
																			
									
										Feb.26,2021
									
								
							
														
								How does php MongoDB turn ObjectId into string
								 the query data is printed to see that _ id is objectID,. If it is directly converted to json,_id, it will be empty. Can you change objectID to string, and then to json? ...
								
									
																														Mongodb
																				php
																			
									
										Feb.26,2021
									
								
							
														
								What is the Virtual of mongoose?
								
 Virtual properties are document properties that you can get and set but that do not get persisted to MongoDB. 
I don  t quite understand this sentence. Please do not translate. I hope you can explain it in your own words. It is better to have examples. ...
								
									
																														Mongodb
																				mongoose
																			
									
										Feb.26,2021
									
								
							
														
								It is invalid for MongoDB, to use the configuration file to modify the database path?!
								
 first of all, according to the description of the official document, the configuration file is created and the database path is modified. The content of the configuration file  mongo.config  is as follows: 
systemLog:
    destination: file
    path: I: ...
								
									
																														Mongodb
																			
									
										Feb.26,2021
									
								
							
														
								Mongodb cannot be started under mac
								
 suddenly became interested in mongodb, installed and started with brew according to the rookie tutorial, but showed  "waiting for connections on port 27017 " for a long time, then   
 as a front-end younger brother who still has a narrow range of knowle...
								
									
																														Node.js
																				javascript
																				mongodb
																			
									
										Feb.27,2021
									
								
							
														
								What is the difference in the way mongodb starts?
								
 Mode 1:  mongod  Command  Mode 2:  brew services start mongodb  Command start 
 what  s the difference between the two ways of starting? why are there two ways? 
...
								
									
																														Node.js
																				mongodb
																			
									
										Feb.27,2021
									
								
							
														
								About mongodb doing document management?
								 the company recently has a project, the main function is to manage documents, such as on the web page to view the contents of the document, and can be modified, you can also download the document to export in word format, the most important thing is to r...
								
									
																														Mongodb
																			
									
										Feb.27,2021
									
								
							
														
								Mongdb 90,000,000 data query fetches
								
 assume a single document type: 
{
    "_id" : ObjectId("5a19403b421aa92332bc2b32"),
    "id" : "95957f4a9eab11e787f1509a4c4be0cd"
    "incre" :1
    "city" ""
}

 data volume, 90 mill...
								
									
																														Mongodb
																			
									
										Feb.27,2021
									
								
							
														
								Explain this custom mongoose schema method
								
var animalSchema = new Schema({ name: String, type: String });

animalSchema.methods.findSimilarTypes = function(cb) {
return this.model(  Animal  ).find({ type: this.type }, cb);
};

var Animal = mongoose.model(  Animal  , animalSchema);
var dog = new A...
								
									
																														Mongodb
																				mongoose
																			
									
										Feb.27,2021
									
								
							
														
								MongoDB link failed mongo "exception: connect failed"
								
 the computer is a WIN 10 system  I follow this tutorial to install mongoDB,   http:  www.runoob.com mongodb.
 but as soon as you enter the mongo link. Just report this error 
  
 has been reinstalled several times, but it has not been solved.  also chan...
								
									
																														Mongod
																				mongodb
																			
									
										Feb.27,2021
									
								
							
														
								Can't connect to the server's mongdb database!
								
 ip and port are OK on the server!  


bind_ip 0.0.0.0

ip
 
 ask the boss to have a look. No, no, no. It  s okay to pay for an answer! 
...
								
									
																														Linux
																				mongodb
																				node.js
																			
									
										Feb.28,2021
									
								
							
														
								How to use mongodb for ifelse Operation in spring boot
								
 data stores a field type (as an enumeration). When querying, it needs to be processed separately by using ifelse or switch, for example, 1 is converted to normal. I  ve implemented it with the MongoDB statement, but how do I handle it in the springboot ...
								
									
																														Java
																				mongodb
																				spring
																			
									
										Feb.28,2021
									
								
							
														
								How does mongodb use aggregate to count the total number of non-duplicated fields in a group?
								
 there is data like this: 
  
 I want to use aggregate to group group by p, and then count the sum of ac, cc in each province, which can be done. The question is: I want to count the total number of tc in each province (do not repeat)? For example, Fujia...
								
									
																														Mongodb
																			
									
										Feb.28,2021
									
								
							
														
								Whether the mongodb replication set can be taken from the disk and taken out separately to make a new mongodb server.
								 I tried to do this, but could not copy set initialization. ...
								
									
																														Mongodb
																			
									
										Feb.28,2021
									
								
							
														
								Python uses pymongo to connect to the local database prompt to actively reject?
								
 the problem is like a question. I  ll describe the details in detail.   asked the same question on Zhihu  
 when I was self-taught python crawler, I came into contact with MongoDB, to install according to the URL given by  rookie tutorial . Visual obser...
								
									
																														Nosql
																				python3.x
																				mongodb
																			
									
										Feb.28,2021
									
								
							
														
								A subset of a subset of mongoose queries
								
 first of all, the models of my mongoose is as follows: 
var clubsSchema = new mongoose.Schema({
    "clubCreatetime": String,
    "clubCreater": String,
    "clubName": String,
    "clubDescription": String,
    &...
								
									
																														Mongodb
																				mongoose
																			
									
										Feb.28,2021
									
								
							
														
								The loop of cursor in pyMongo is very slow, is there any effective way to solve it?
								
 there are 200000 data in the data, and each data has a typical list of words of 5000 length [{aVl1}, {bju 2},.], and typical dictionaries of 5000 length {{aRom 1}, {bjr 2},.}, use pyMongo to find100 data, the traversal of cursor is particularly slow, is...
								
									
																														Python
																				mongodb
																				pymongo
																				database
																				database-performance-optimization
																			
									
										Feb.28,2021
									
								
							
														
								What is mongoose streaming??
								
 You can stream query results from MongoDB. You need to call the Query-sharpcursor () function to return an instance of QueryCursor. 
 Please do not translate directly. I want to know what cursor is and what stream is, and why I use it, thank you! 
...
								
									
																														Mongodb
																				mongoose
																			
									
										Feb.28,2021
									
								
							
														
								Ask mongodb this kind of nosql database design, are there any basic principles?
								
 mongodb doesn  t seem to have the join functionality of sql. Reducing the redundancy of individual tables does not seem to make sense and affects query efficiency.  and mongodb also has storage formats such as lists, which can achieve different function...
								
									
																														Mongodb
																			
									
										Feb.28,2021


				
					
						
	
						
		css
		mysql
		arrays
		josn
		react
		html
		typescript
		webpack
		npm
		sass
		R
		objective-c
		.net
		sql-server
		jquery
		python-3.x
		angularjs
		django
		angular
		excel
		regex
		iphone
		ajax
		linux
		xml
		pandas
		vba
		spring
		database
		wordpress
		string
		wpf
		xcode
		windows
		bash
		postgresql
		oracle
		multithreading
		eclipse
		list
		firebase
		algorithm
		macos
		forms
		image
		scala
		visual-studio
		azure
		bootstrap
		spring-boot
		react-native
		python-2.7
		docker
		performance
		function
		winforms
		matlab
		powershell
		apache
		dataframe
		api
		sqlite
		numpy
		rest
		shell
		selenium
		flutter
		dart
		maven
		loops
		qt
		swing
		android-studio
		csv
		express
		file
		class
		tensorflow
		sorting
		codeigniter
		perl
	
						MySQL Query :  SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5 
 MySQL Error : Disk full (/tmp/#sql-temptable-64f5-348b0c8-1c6f8.MAI); waiting for someone to free some space... (errno: 28 "No space left on device") 
 MySQL Errno : 1021 
 Message :  Disk full (/tmp/#sql-temptable-64f5-348b0c8-1c6f8.MAI); waiting for someone to free some space... (errno: 28 "No space left on device") 
Need Help?