axondendriteplus commited on
Commit
366eeed
·
verified ·
1 Parent(s): b921fe1

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md CHANGED
@@ -1,3 +1,1025 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - sentence-transformers
7
+ - sentence-similarity
8
+ - feature-extraction
9
+ - generated_from_trainer
10
+ - dataset_size:1456
11
+ - loss:MatryoshkaLoss
12
+ - loss:MultipleNegativesRankingLoss
13
+ base_model: Snowflake/snowflake-arctic-embed-m-v2.0
14
+ widget:
15
+ - source_sentence: 'relation to a portfolio manager being a body corporate, shall
16
+ be construed with reference to : (i) the definition of control in terms of Regulation
17
+ 2(1)(e) of SEBI (Substantial Acquisition of Shares and Takeovers) Regulations,
18
+ 2011 as amended from time to time, if its shares are listed on any recognized
19
+ stock exchange; (ii) in any other case, change in the controlling interest in
20
+ the body corporate; Explanation. For the purpose of sub-clause (ii), the expression
21
+ controlling interest means, (A) an interest, whether direct or indirect, to the
22
+ extent of at least fifty-one percent of voting rights in the body corporate; (B)
23
+ right to appoint majority of the directors or to control the management directly
24
+ or indirectly. Page 3 of 78 reference to the definition of control in terms of
25
+ regulations framed under clause (h) of sub-section (2) of section 11 of the Act;
26
+ (B) if its shares are not listed on any recognised stock exchange, shall be construed
27
+ with reference to the definition of control as provided in sub-section (27) of
28
+ Section 2 of the Companies Act, 2013 (18 of 2013);] (f) chartered accountant"
29
+ means a chartered accountant as defined in clause (b) of sub- section (1) of section
30
+ 2 of the Chartered Accountants Act, 1949 (38 of 1949) and who has obtained a certificate
31
+ of practice under sub-section (1) of section 6 of that Act; 5[(fa) Co-investment
32
+ Portfolio Manager means a Portfolio Manager who is a Manager of a Category I or
33
+ Category II Alternative Investment Fund(s); and: (i) provides services only to
34
+ the investors of such Category I or Category II Alternative Investment Fund(s);
35
+ and (ii) makes investment only in unlisted securities of investee companies where
36
+ such Category I or Category II Alternative Investment Fund(s) make investments:
37
+ Provided that the Co-investment Portfolio Manager may provide services to investors
38
+ from any other Category I or Category II Alternative Investment Fund(s) which
39
+ are managed by them and are also sponsored by the same Sponsor(s);] (g) discretionary
40
+ portfolio manager means a portfolio manager who under a contract relating to portfolio
41
+ management, exercises or may exercise, any degree of discretion as to the investment
42
+ of funds or management of the portfolio of securities of the client, as the case
43
+ may be; (h) eligible fund manager shall have the same meaning as assigned to it
44
+ in sub-section (4) of Section 9A of the Income-tax Act, 1961; (i) eligible investment
45
+ fund shall have the same meaning as assigned to it in sub-section (3) of Section
46
+ 9A of the Income-tax Act, 1961; 5 Inserted by the Securities and Exchange Board
47
+ of India (Portfolio Managers) (Fourth Amendment) Regulations, 2021 w. Page 4 of
48
+ 78 (j) "form" means a form specified in Schedule I; (k) goods means the goods
49
+ notified by the Central Government under clause (bc) of section 2 of the Securities
50
+ Contracts (Regulation) Act, 1956 and forming the underlying of any commodity derivative;
51
+ (l) "inspecting authority" means one or more persons appointed by the Board to
52
+ exercise powers conferred under Chapter V; 6[(la) large value accredited investor
53
+ means an accredited investor who has entered into an agreement with the portfolio
54
+ manager for a minimum investment amount of ten crore rupees;] 7[(lb) investee
55
+ company shall have the same meaning as assigned to it in clause (o) of sub- regulation
56
+ (1) of regulation 2 of the Securities and Exchange Board of India (Alternative
57
+ Investment Funds) Regulations, 2012; (lc) Manager shall'
58
+ sentences:
59
+ - What are the particulars of units of the scheme and/or shares and debentures of
60
+ the company issued for consideration other than cash?
61
+ - What is the maximum period allowed for a listed company to increase its public
62
+ shareholding to twenty-five percent after it falls below that threshold as a result
63
+ of an approved resolution plan under the Insolvency and Bankruptcy Code, 2016?
64
+ - What is the definition of "controlling interest" in the context of a body corporate
65
+ under the SEBI regulations?
66
+ - source_sentence: 'The issuer making a private placement of debt securities and non-convertible
67
+ redeemable preference shares and seeking listing thereof on a recognised stock
68
+ exchange shall make the following disclosures in the placement memorandum: (a)
69
+ disclosures specified in 28[Schedule I] of these regulations; (b) disclosures
70
+ specified in the Companies Act, 2013 (18 of 2013), as applicable; (c) additional
71
+ disclosures as may be specified by the Board. (2) The disclosures as provided
72
+ in sub-regulation (1) shall be made on the websites of stock exchange(s) where
73
+ such securities are proposed to be listed and shall be available for download
74
+ in PDF or any other format as may be specified by the Board. (3) The issuer shall
75
+ ensure that the audited financial statements contained in the placement memorandum
76
+ 29[] shall not be more than six months old from the date of filing placement memorandum
77
+ or the issue opening date, as applicable: 27 Substituted by the by the Securities
78
+ and Exchange Board of India (Issue and Listing of Non-Convertible Securities)
79
+ (Amendment) Regulations, 2024 w. Prior to substitution, it read as The debenture
80
+ trustee shall submit a due diligence certificate to the stock exchange: (a) in
81
+ case of secured debt securities, in the format as specified in Schedule IV of
82
+ these regulations; and (b) in case of unsecured debt securities, in the format
83
+ as specified in Schedule IVA of these regulations. Prior to this, it was substituted
84
+ by the Securities and Exchange Board of India (Issue and Listing of Non-Convertible
85
+ Securities) (Amendment) Regulations, 2022, w. Prior to substitution, sub-regulation
86
+ 3 read as: Debenture trustee shall submit a due diligence certificate to the stock
87
+ exchange in the format as specified in Schedule IV of these regulations. 28 Substituted
88
+ by the Securities and Exchange Board of India (Issue and Listing of Non-Convertible
89
+ Securities) (Second Amendment) Regulations, 2023, w. Prior to substitution, the
90
+ words were Schedule II. 29 Omitted by the Securities and Exchange Board of India
91
+ (Issue and Listing of Non-Convertible Securities) (Second Amendment) Regulations,
92
+ 2023, w. Prior to omission, the words were and tranche placement memorandum. Page
93
+ 26 of 79 Provided that in case of: (a) listed issuers (whose non-convertible securities
94
+ or specified securities are listed on recognised stock exchange(s)), who are in
95
+ compliance with the listing regulations; (b) the issuers of non-convertible securities,
96
+ who are subsidiaries of entities who have listed their specified securities, and
97
+ are in compliance with the listing regulations, instead of audited financial statements
98
+ for the stub period, they may disclose unaudited financial information for such
99
+ period in the format as prescribed in the listing regulations with limited review
100
+ report, as filed with the stock exchange(s), subject to necessary disclosures
101
+ in this regard in the placement memorandum including risk factors. Allotment of
102
+ securities 46. The issuer shall ensure allotment of debt securities and non-convertible
103
+ redeemable preference shares issued on a private placement basis and credit to
104
+ the dematerialised account of the investors, is made within such time as may be
105
+ specified by the Board. PART B ADDITIONAL PROVISIONS FOR LISTING OF DEBT SECURITIES
106
+ ISSUED ON PRIVATE PLACEMENT BASIS Filing of shelf placement memorandum 47. 30[]
107
+ Creation of security 48. (1) While creating a charge or security, the issuer shall
108
+ have the option to create charge or security over the properties or assets (movable,
109
+ immovable, tangible, intangible), shares or any interest thereon, of the issuer
110
+ or its subsidiaries or its holding companies or its associate companies. 30 Omitted
111
+ by the Securities and Exchange Board of India'
112
+ sentences:
113
+ - What is the penalty for a reporting requirement violation under the PIT Regulations
114
+ if there is a delay of more than three months?
115
+ - What are the names of the members of the issuer's audit committee, nomination
116
+ and remuneration committee, and stakeholders relationship committee?
117
+ - What disclosures must an issuer include in the placement memorandum when making
118
+ a private placement of debt securities and non-convertible redeemable preference
119
+ shares?
120
+ - source_sentence: 'Accredited refineries means refineries empanelled by the Stock
121
+ Exchanges; Page 4 of 26 (c) Assayer means a person engaged in the process of assessing
122
+ the purity or quality of gold; (d) Beneficial Owner means a person whose name
123
+ is recorded as such with a depository/depository participant; (e) Board means
124
+ the Securities and Exchange Board of India established under section 3 of the
125
+ Act; 1[(f) Change in control in case of a body corporate (A) if its shares are
126
+ listed on any recognised stock exchange, shall be construed with reference to
127
+ the definition of control in terms of regulations framed under clause (h) of sub-section
128
+ (2) of section 11 of the Act; (B) if its shares are not listed on any recognised
129
+ stock exchange, shall be construed with reference to the definition of control
130
+ as provided in sub-section (27) of Section 2 of the Companies Act, 2013 (18 of
131
+ 2013);] (g) Depositor means a person who owns the gold deposited with the vault
132
+ for creation of Electronic Gold Receipt and its trading on recognized stock exchange;
133
+ (h) Electronic Gold Receipt shall have the meaning assigned to it under the Securities
134
+ Contracts (Regulation) Act, 1956; (i) Gold standard means the purity and standard
135
+ of gold as specified by the recognized stock exchanges; (j) Nominated agencies
136
+ means agencies nominated by the Directorate General of Foreign Trade for import
137
+ of Gold under the Foreign Trade (Development and Regulation) Act, 1992; 1 Substituted
138
+ by the Securities and Exchange Board of India (Change in Control in Intermediaries)
139
+ (Amendment) Regulations, 2023 w. Prior to the substitution, clause (f) read as
140
+ under: Change in control, in relation to a Vault Manager being a body corporate,
141
+ shall be construed with reference to: (i) the definition of control in terms of
142
+ regulation 2(1)(e) of SEBI (Substantial Acquisition of Shares and Takeovers) Regulations,
143
+ 2011 as amended from time to time if its shares are listed on any recognized stock
144
+ exchange; (ii) in any other case, change in the controlling interest in the body
145
+ corporate; Explanation For the purpose of sub-clause (ii), the expression controlling
146
+ interest means- (A) an interest, whether direct or indirect, to the extent of
147
+ at least fifty-one percent of voting rights in the body corporate; or (B) right
148
+ to appoint majority of the directors or to control the management directly or
149
+ indirectly; Page 5 of 26 (k) Recognized vault means the premises encompassing
150
+ strong room(s) set up and managed by the Vault Manager and which conforms with
151
+ all the requirements specified by the Board for the purpose of providing vaulting
152
+ services; (l) Vault Manager means any person who carries on or intends to carry
153
+ on the business of providing vaulting services; (m) Vaulting service in relation
154
+ to gold means the storage and safekeeping of gold deposited with the Vault Manager,
155
+ by the depositor, for the purpose of trading in Electronic Gold Receipt and providing
156
+ services incidental thereto, and includes (i) utilizing the services of assayers
157
+ empanelled with the Stock Exchanges for testing as per the gold standard, wherever
158
+ required; (ii) coordination with depositories for creation, transfer and extinguishment
159
+ of Electronic Gold Receipt; and (iii) providing deposit, storage and withdrawal
160
+ services to the beneficial owners. (2) The words and expressions used and not
161
+ defined in these regulations, but defined in the Act, the Securities Contracts
162
+ (Regulation) Act, 1956, (42 of 1956), the Companies Act, 2013 (18 of 2013), the
163
+ Depositories Act, 1996, or any rules'
164
+ sentences:
165
+ - What happens if the eighth day, as reckoned under the Negotiable Instruments Act,
166
+ is itself a public holiday?
167
+ - What is the role of an Assayer according to the context provided?
168
+ - What disclosures are required regarding significant income sources that constitute
169
+ more than 10% of total income?
170
+ - source_sentence: 'Regulations, 2023 w. 93 Inserted by the Securities and Exchange
171
+ Board of India (Buy-Back of Securities) (Second Amendment) Regulations, 2024 w.
172
+ Particulars Content Public Announcement i) The Public announcement shall be dated
173
+ and signed on behalf of the Board of Directors of the company by its manager or
174
+ secretary, if any, and by not less than two directors of the company one of whom
175
+ shall be a managing director where there is one. ii) A full and complete disclosure
176
+ of all material facts including the disclosures mentioned in Schedule I. iii)
177
+ In addition to the disclosures in Schedule A, the following disclosures shall
178
+ be made: i) Date of shareholders approval for buy-back, if applicable; ii) Minimum
179
+ and maximum number of securities that the company proposes to buy-back, sources
180
+ of funds from which the buy-back would be made and the cost of financing the buy-back;
181
+ iii) Proposed time table from opening of offer till the extinguishment of the
182
+ certificates; iv) Process and methodology to be adopted for the buy- back; v)
183
+ Brief information about the company; i) The Public announcement shall be dated
184
+ and signed on behalf of the Board of Directors of the company by its manager or
185
+ secretary, if any, and by not less than two directors of the company one of whom
186
+ shall be a managing director where there is one. ii) A full and complete disclosure
187
+ of all material facts including the disclosures mentioned in Schedule I. Page
188
+ 46 of 51 SCHEDULE - IV [Regulation 16(iv)(b)] Public Announcement for Open Market
189
+ Buy-Back through Stock Exchange Particulars Content Public Announcement i) The
190
+ Public announcement shall be dated and signed on behalf of the Board of Directors
191
+ of the company by its manager or secretary, if any, and by not less than two directors
192
+ of the company one of whom shall be a managing director where there is one. ii)
193
+ A full and complete disclosure of all material facts including the disclosures
194
+ mentioned in Schedule I. iii) In addition to the disclosures in Schedule A, the
195
+ following disclosures shall be made: i) Date of shareholders approval for buy-back,
196
+ if applicable; ii) Minimum and maximum number of securities that the company proposes
197
+ to buy-back, sources of funds from which the buy-back would be made and the cost
198
+ of financing the buy-back; iii) Proposed time table from opening of offer till
199
+ the extinguishment of the certificates; iv) Process and methodology to be adopted
200
+ for the buy- back; v) Brief information about the company; Particulars Content
201
+ vi) Audited Financial information for the last 3 years and the lead manager shall
202
+ ensure that the particulars (audited statement and un-audited statement) contained
203
+ therein shall not be more than more than 6 months old from the date of the public
204
+ announcement together with financial ratios as may be specified by the Board;
205
+ Explanation: Ensure that the un-audited financial results, if any disclosed, should
206
+ be certified / limited review by statutory auditors. vii) Details of escrow account
207
+ opened and the amount deposited therein; viii) Listing details and stock market
208
+ data: a) high, low and average market prices of the securities of the company
209
+ proposed to be bought back, during the preceding three years; b) monthly high
210
+ and low prices for the six months preceding the date of the public announcement;
211
+ c) the number of securities traded on the days when the high and low prices were
212
+ recorded on the relevant stock exchanges during the period stated at (a) and (b)
213
+ above; d) the stock market data referred to above shall be shown separately for
214
+ periods marked by a change in capital structure, with such period commencing from
215
+ the date the concerned stock exchange recognises the change in the capital structure.
216
+ when the securities have become ex-rights or ex-bonus) ; e)'
217
+ sentences:
218
+ - What is the time frame within which a return of allotment of securities must be
219
+ filed with the Registrar according to the Companies (Registration Offices and
220
+ Fees) Rules, 2014?
221
+ - What is the deadline for a listed entity to put in place systems and processes
222
+ for compliance with clause (f) of sub-regulation (2) of regulation 34 after it
223
+ is required to comply for the first time?
224
+ - What are the specific disclosures required in a public announcement for a buy-back
225
+ of securities according to the Securities and Exchange Board of India regulations?
226
+ - source_sentence: 1996. In section 19 of the Depositories Act, 1996 (hereafter in
227
+ this chapter referred to as the principal Act in this chapter), the following
228
+ Explanation shall be inserted, namely:- Explanation. For the removal of doubts,
229
+ it is hereby declared that power to issue directions under this section shall
230
+ include and always be deemed to have been included the power to direct any person,
231
+ who made profit or averted loss by indulging in any transaction or activity in
232
+ contravention of the provisions of this Act or regulations made thereunder, to
233
+ disgorge an amount equivalent to the wrongful gain made or loss averted by such
234
+ contravention.
235
+ sentences:
236
+ - What are the qualifications required for a judge to be appointed to a Special
237
+ Court under this Act?
238
+ - What is the purpose of the Explanation inserted in section 19 of the Depositories
239
+ Act, 1996?
240
+ - What is the minimum percentage of total issued shares that the acquirer must reach
241
+ in order for the delisting offer to be considered successful?
242
+ pipeline_tag: sentence-similarity
243
+ library_name: sentence-transformers
244
+ metrics:
245
+ - cosine_accuracy@1
246
+ - cosine_accuracy@3
247
+ - cosine_accuracy@5
248
+ - cosine_accuracy@10
249
+ - cosine_precision@1
250
+ - cosine_precision@3
251
+ - cosine_precision@5
252
+ - cosine_precision@10
253
+ - cosine_recall@1
254
+ - cosine_recall@3
255
+ - cosine_recall@5
256
+ - cosine_recall@10
257
+ - cosine_ndcg@10
258
+ - cosine_mrr@10
259
+ - cosine_map@100
260
+ model-index:
261
+ - name: BGE base Financial Matryoshka
262
+ results:
263
+ - task:
264
+ type: information-retrieval
265
+ name: Information Retrieval
266
+ dataset:
267
+ name: dim 768
268
+ type: dim_768
269
+ metrics:
270
+ - type: cosine_accuracy@1
271
+ value: 0.4691358024691358
272
+ name: Cosine Accuracy@1
273
+ - type: cosine_accuracy@3
274
+ value: 0.7469135802469136
275
+ name: Cosine Accuracy@3
276
+ - type: cosine_accuracy@5
277
+ value: 0.845679012345679
278
+ name: Cosine Accuracy@5
279
+ - type: cosine_accuracy@10
280
+ value: 0.9135802469135802
281
+ name: Cosine Accuracy@10
282
+ - type: cosine_precision@1
283
+ value: 0.4691358024691358
284
+ name: Cosine Precision@1
285
+ - type: cosine_precision@3
286
+ value: 0.24897119341563784
287
+ name: Cosine Precision@3
288
+ - type: cosine_precision@5
289
+ value: 0.1691358024691358
290
+ name: Cosine Precision@5
291
+ - type: cosine_precision@10
292
+ value: 0.09135802469135802
293
+ name: Cosine Precision@10
294
+ - type: cosine_recall@1
295
+ value: 0.4691358024691358
296
+ name: Cosine Recall@1
297
+ - type: cosine_recall@3
298
+ value: 0.7469135802469136
299
+ name: Cosine Recall@3
300
+ - type: cosine_recall@5
301
+ value: 0.845679012345679
302
+ name: Cosine Recall@5
303
+ - type: cosine_recall@10
304
+ value: 0.9135802469135802
305
+ name: Cosine Recall@10
306
+ - type: cosine_ndcg@10
307
+ value: 0.6931888396302245
308
+ name: Cosine Ndcg@10
309
+ - type: cosine_mrr@10
310
+ value: 0.6220483049186752
311
+ name: Cosine Mrr@10
312
+ - type: cosine_map@100
313
+ value: 0.6258770441533618
314
+ name: Cosine Map@100
315
+ - task:
316
+ type: information-retrieval
317
+ name: Information Retrieval
318
+ dataset:
319
+ name: dim 512
320
+ type: dim_512
321
+ metrics:
322
+ - type: cosine_accuracy@1
323
+ value: 0.4506172839506173
324
+ name: Cosine Accuracy@1
325
+ - type: cosine_accuracy@3
326
+ value: 0.7530864197530864
327
+ name: Cosine Accuracy@3
328
+ - type: cosine_accuracy@5
329
+ value: 0.8271604938271605
330
+ name: Cosine Accuracy@5
331
+ - type: cosine_accuracy@10
332
+ value: 0.9074074074074074
333
+ name: Cosine Accuracy@10
334
+ - type: cosine_precision@1
335
+ value: 0.4506172839506173
336
+ name: Cosine Precision@1
337
+ - type: cosine_precision@3
338
+ value: 0.2510288065843621
339
+ name: Cosine Precision@3
340
+ - type: cosine_precision@5
341
+ value: 0.16543209876543208
342
+ name: Cosine Precision@5
343
+ - type: cosine_precision@10
344
+ value: 0.09074074074074073
345
+ name: Cosine Precision@10
346
+ - type: cosine_recall@1
347
+ value: 0.4506172839506173
348
+ name: Cosine Recall@1
349
+ - type: cosine_recall@3
350
+ value: 0.7530864197530864
351
+ name: Cosine Recall@3
352
+ - type: cosine_recall@5
353
+ value: 0.8271604938271605
354
+ name: Cosine Recall@5
355
+ - type: cosine_recall@10
356
+ value: 0.9074074074074074
357
+ name: Cosine Recall@10
358
+ - type: cosine_ndcg@10
359
+ value: 0.6862124164896819
360
+ name: Cosine Ndcg@10
361
+ - type: cosine_mrr@10
362
+ value: 0.6141950813247109
363
+ name: Cosine Mrr@10
364
+ - type: cosine_map@100
365
+ value: 0.618139737272647
366
+ name: Cosine Map@100
367
+ - task:
368
+ type: information-retrieval
369
+ name: Information Retrieval
370
+ dataset:
371
+ name: dim 256
372
+ type: dim_256
373
+ metrics:
374
+ - type: cosine_accuracy@1
375
+ value: 0.4382716049382716
376
+ name: Cosine Accuracy@1
377
+ - type: cosine_accuracy@3
378
+ value: 0.7345679012345679
379
+ name: Cosine Accuracy@3
380
+ - type: cosine_accuracy@5
381
+ value: 0.8271604938271605
382
+ name: Cosine Accuracy@5
383
+ - type: cosine_accuracy@10
384
+ value: 0.8950617283950617
385
+ name: Cosine Accuracy@10
386
+ - type: cosine_precision@1
387
+ value: 0.4382716049382716
388
+ name: Cosine Precision@1
389
+ - type: cosine_precision@3
390
+ value: 0.24485596707818924
391
+ name: Cosine Precision@3
392
+ - type: cosine_precision@5
393
+ value: 0.16543209876543208
394
+ name: Cosine Precision@5
395
+ - type: cosine_precision@10
396
+ value: 0.08950617283950617
397
+ name: Cosine Precision@10
398
+ - type: cosine_recall@1
399
+ value: 0.4382716049382716
400
+ name: Cosine Recall@1
401
+ - type: cosine_recall@3
402
+ value: 0.7345679012345679
403
+ name: Cosine Recall@3
404
+ - type: cosine_recall@5
405
+ value: 0.8271604938271605
406
+ name: Cosine Recall@5
407
+ - type: cosine_recall@10
408
+ value: 0.8950617283950617
409
+ name: Cosine Recall@10
410
+ - type: cosine_ndcg@10
411
+ value: 0.6725732937028854
412
+ name: Cosine Ndcg@10
413
+ - type: cosine_mrr@10
414
+ value: 0.6002816970409561
415
+ name: Cosine Mrr@10
416
+ - type: cosine_map@100
417
+ value: 0.6055931590500198
418
+ name: Cosine Map@100
419
+ - task:
420
+ type: information-retrieval
421
+ name: Information Retrieval
422
+ dataset:
423
+ name: dim 128
424
+ type: dim_128
425
+ metrics:
426
+ - type: cosine_accuracy@1
427
+ value: 0.41358024691358025
428
+ name: Cosine Accuracy@1
429
+ - type: cosine_accuracy@3
430
+ value: 0.6851851851851852
431
+ name: Cosine Accuracy@3
432
+ - type: cosine_accuracy@5
433
+ value: 0.7777777777777778
434
+ name: Cosine Accuracy@5
435
+ - type: cosine_accuracy@10
436
+ value: 0.8703703703703703
437
+ name: Cosine Accuracy@10
438
+ - type: cosine_precision@1
439
+ value: 0.41358024691358025
440
+ name: Cosine Precision@1
441
+ - type: cosine_precision@3
442
+ value: 0.22839506172839505
443
+ name: Cosine Precision@3
444
+ - type: cosine_precision@5
445
+ value: 0.15555555555555553
446
+ name: Cosine Precision@5
447
+ - type: cosine_precision@10
448
+ value: 0.08703703703703702
449
+ name: Cosine Precision@10
450
+ - type: cosine_recall@1
451
+ value: 0.41358024691358025
452
+ name: Cosine Recall@1
453
+ - type: cosine_recall@3
454
+ value: 0.6851851851851852
455
+ name: Cosine Recall@3
456
+ - type: cosine_recall@5
457
+ value: 0.7777777777777778
458
+ name: Cosine Recall@5
459
+ - type: cosine_recall@10
460
+ value: 0.8703703703703703
461
+ name: Cosine Recall@10
462
+ - type: cosine_ndcg@10
463
+ value: 0.6396728651848874
464
+ name: Cosine Ndcg@10
465
+ - type: cosine_mrr@10
466
+ value: 0.5658534195571232
467
+ name: Cosine Mrr@10
468
+ - type: cosine_map@100
469
+ value: 0.5718872882660476
470
+ name: Cosine Map@100
471
+ - task:
472
+ type: information-retrieval
473
+ name: Information Retrieval
474
+ dataset:
475
+ name: dim 64
476
+ type: dim_64
477
+ metrics:
478
+ - type: cosine_accuracy@1
479
+ value: 0.345679012345679
480
+ name: Cosine Accuracy@1
481
+ - type: cosine_accuracy@3
482
+ value: 0.5802469135802469
483
+ name: Cosine Accuracy@3
484
+ - type: cosine_accuracy@5
485
+ value: 0.6851851851851852
486
+ name: Cosine Accuracy@5
487
+ - type: cosine_accuracy@10
488
+ value: 0.7901234567901234
489
+ name: Cosine Accuracy@10
490
+ - type: cosine_precision@1
491
+ value: 0.345679012345679
492
+ name: Cosine Precision@1
493
+ - type: cosine_precision@3
494
+ value: 0.19341563786008228
495
+ name: Cosine Precision@3
496
+ - type: cosine_precision@5
497
+ value: 0.13703703703703704
498
+ name: Cosine Precision@5
499
+ - type: cosine_precision@10
500
+ value: 0.07901234567901233
501
+ name: Cosine Precision@10
502
+ - type: cosine_recall@1
503
+ value: 0.345679012345679
504
+ name: Cosine Recall@1
505
+ - type: cosine_recall@3
506
+ value: 0.5802469135802469
507
+ name: Cosine Recall@3
508
+ - type: cosine_recall@5
509
+ value: 0.6851851851851852
510
+ name: Cosine Recall@5
511
+ - type: cosine_recall@10
512
+ value: 0.7901234567901234
513
+ name: Cosine Recall@10
514
+ - type: cosine_ndcg@10
515
+ value: 0.5603350026673091
516
+ name: Cosine Ndcg@10
517
+ - type: cosine_mrr@10
518
+ value: 0.48750489907897315
519
+ name: Cosine Mrr@10
520
+ - type: cosine_map@100
521
+ value: 0.49429085785187665
522
+ name: Cosine Map@100
523
+ ---
524
+
525
+ # BGE base Financial Matryoshka
526
+
527
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Snowflake/snowflake-arctic-embed-m-v2.0](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v2.0) on the json dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
528
+
529
+ ## Model Details
530
+
531
+ ### Model Description
532
+ - **Model Type:** Sentence Transformer
533
+ - **Base model:** [Snowflake/snowflake-arctic-embed-m-v2.0](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v2.0) <!-- at revision 95c2741480856aa9666782eb4afe11959938017f -->
534
+ - **Maximum Sequence Length:** 8192 tokens
535
+ - **Output Dimensionality:** 768 dimensions
536
+ - **Similarity Function:** Cosine Similarity
537
+ - **Training Dataset:**
538
+ - json
539
+ - **Language:** en
540
+ - **License:** apache-2.0
541
+
542
+ ### Model Sources
543
+
544
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
545
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
546
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
547
+
548
+ ### Full Model Architecture
549
+
550
+ ```
551
+ SentenceTransformer(
552
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: GteModel
553
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
554
+ (2): Normalize()
555
+ )
556
+ ```
557
+
558
+ ## Usage
559
+
560
+ ### Direct Usage (Sentence Transformers)
561
+
562
+ First install the Sentence Transformers library:
563
+
564
+ ```bash
565
+ pip install -U sentence-transformers
566
+ ```
567
+
568
+ Then you can load this model and run inference.
569
+ ```python
570
+ from sentence_transformers import SentenceTransformer
571
+
572
+ # Download from the 🤗 Hub
573
+ model = SentenceTransformer("sentence_transformers_model_id")
574
+ # Run inference
575
+ sentences = [
576
+ '1996. In section 19 of the Depositories Act, 1996 (hereafter in this chapter referred to as the principal Act in this chapter), the following Explanation shall be inserted, namely:- Explanation. For the removal of doubts, it is hereby declared that power to issue directions under this section shall include and always be deemed to have been included the power to direct any person, who made profit or averted loss by indulging in any transaction or activity in contravention of the provisions of this Act or regulations made thereunder, to disgorge an amount equivalent to the wrongful gain made or loss averted by such contravention.',
577
+ 'What is the purpose of the Explanation inserted in section 19 of the Depositories Act, 1996?',
578
+ 'What are the qualifications required for a judge to be appointed to a Special Court under this Act?',
579
+ ]
580
+ embeddings = model.encode(sentences)
581
+ print(embeddings.shape)
582
+ # [3, 768]
583
+
584
+ # Get the similarity scores for the embeddings
585
+ similarities = model.similarity(embeddings, embeddings)
586
+ print(similarities.shape)
587
+ # [3, 3]
588
+ ```
589
+
590
+ <!--
591
+ ### Direct Usage (Transformers)
592
+
593
+ <details><summary>Click to see the direct usage in Transformers</summary>
594
+
595
+ </details>
596
+ -->
597
+
598
+ <!--
599
+ ### Downstream Usage (Sentence Transformers)
600
+
601
+ You can finetune this model on your own dataset.
602
+
603
+ <details><summary>Click to expand</summary>
604
+
605
+ </details>
606
+ -->
607
+
608
+ <!--
609
+ ### Out-of-Scope Use
610
+
611
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
612
+ -->
613
+
614
+ ## Evaluation
615
+
616
+ ### Metrics
617
+
618
+ #### Information Retrieval
619
+
620
+ * Dataset: `dim_768`
621
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
622
+ ```json
623
+ {
624
+ "truncate_dim": 768
625
+ }
626
+ ```
627
+
628
+ | Metric | Value |
629
+ |:--------------------|:-----------|
630
+ | cosine_accuracy@1 | 0.4691 |
631
+ | cosine_accuracy@3 | 0.7469 |
632
+ | cosine_accuracy@5 | 0.8457 |
633
+ | cosine_accuracy@10 | 0.9136 |
634
+ | cosine_precision@1 | 0.4691 |
635
+ | cosine_precision@3 | 0.249 |
636
+ | cosine_precision@5 | 0.1691 |
637
+ | cosine_precision@10 | 0.0914 |
638
+ | cosine_recall@1 | 0.4691 |
639
+ | cosine_recall@3 | 0.7469 |
640
+ | cosine_recall@5 | 0.8457 |
641
+ | cosine_recall@10 | 0.9136 |
642
+ | **cosine_ndcg@10** | **0.6932** |
643
+ | cosine_mrr@10 | 0.622 |
644
+ | cosine_map@100 | 0.6259 |
645
+
646
+ #### Information Retrieval
647
+
648
+ * Dataset: `dim_512`
649
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
650
+ ```json
651
+ {
652
+ "truncate_dim": 512
653
+ }
654
+ ```
655
+
656
+ | Metric | Value |
657
+ |:--------------------|:-----------|
658
+ | cosine_accuracy@1 | 0.4506 |
659
+ | cosine_accuracy@3 | 0.7531 |
660
+ | cosine_accuracy@5 | 0.8272 |
661
+ | cosine_accuracy@10 | 0.9074 |
662
+ | cosine_precision@1 | 0.4506 |
663
+ | cosine_precision@3 | 0.251 |
664
+ | cosine_precision@5 | 0.1654 |
665
+ | cosine_precision@10 | 0.0907 |
666
+ | cosine_recall@1 | 0.4506 |
667
+ | cosine_recall@3 | 0.7531 |
668
+ | cosine_recall@5 | 0.8272 |
669
+ | cosine_recall@10 | 0.9074 |
670
+ | **cosine_ndcg@10** | **0.6862** |
671
+ | cosine_mrr@10 | 0.6142 |
672
+ | cosine_map@100 | 0.6181 |
673
+
674
+ #### Information Retrieval
675
+
676
+ * Dataset: `dim_256`
677
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
678
+ ```json
679
+ {
680
+ "truncate_dim": 256
681
+ }
682
+ ```
683
+
684
+ | Metric | Value |
685
+ |:--------------------|:-----------|
686
+ | cosine_accuracy@1 | 0.4383 |
687
+ | cosine_accuracy@3 | 0.7346 |
688
+ | cosine_accuracy@5 | 0.8272 |
689
+ | cosine_accuracy@10 | 0.8951 |
690
+ | cosine_precision@1 | 0.4383 |
691
+ | cosine_precision@3 | 0.2449 |
692
+ | cosine_precision@5 | 0.1654 |
693
+ | cosine_precision@10 | 0.0895 |
694
+ | cosine_recall@1 | 0.4383 |
695
+ | cosine_recall@3 | 0.7346 |
696
+ | cosine_recall@5 | 0.8272 |
697
+ | cosine_recall@10 | 0.8951 |
698
+ | **cosine_ndcg@10** | **0.6726** |
699
+ | cosine_mrr@10 | 0.6003 |
700
+ | cosine_map@100 | 0.6056 |
701
+
702
+ #### Information Retrieval
703
+
704
+ * Dataset: `dim_128`
705
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
706
+ ```json
707
+ {
708
+ "truncate_dim": 128
709
+ }
710
+ ```
711
+
712
+ | Metric | Value |
713
+ |:--------------------|:-----------|
714
+ | cosine_accuracy@1 | 0.4136 |
715
+ | cosine_accuracy@3 | 0.6852 |
716
+ | cosine_accuracy@5 | 0.7778 |
717
+ | cosine_accuracy@10 | 0.8704 |
718
+ | cosine_precision@1 | 0.4136 |
719
+ | cosine_precision@3 | 0.2284 |
720
+ | cosine_precision@5 | 0.1556 |
721
+ | cosine_precision@10 | 0.087 |
722
+ | cosine_recall@1 | 0.4136 |
723
+ | cosine_recall@3 | 0.6852 |
724
+ | cosine_recall@5 | 0.7778 |
725
+ | cosine_recall@10 | 0.8704 |
726
+ | **cosine_ndcg@10** | **0.6397** |
727
+ | cosine_mrr@10 | 0.5659 |
728
+ | cosine_map@100 | 0.5719 |
729
+
730
+ #### Information Retrieval
731
+
732
+ * Dataset: `dim_64`
733
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
734
+ ```json
735
+ {
736
+ "truncate_dim": 64
737
+ }
738
+ ```
739
+
740
+ | Metric | Value |
741
+ |:--------------------|:-----------|
742
+ | cosine_accuracy@1 | 0.3457 |
743
+ | cosine_accuracy@3 | 0.5802 |
744
+ | cosine_accuracy@5 | 0.6852 |
745
+ | cosine_accuracy@10 | 0.7901 |
746
+ | cosine_precision@1 | 0.3457 |
747
+ | cosine_precision@3 | 0.1934 |
748
+ | cosine_precision@5 | 0.137 |
749
+ | cosine_precision@10 | 0.079 |
750
+ | cosine_recall@1 | 0.3457 |
751
+ | cosine_recall@3 | 0.5802 |
752
+ | cosine_recall@5 | 0.6852 |
753
+ | cosine_recall@10 | 0.7901 |
754
+ | **cosine_ndcg@10** | **0.5603** |
755
+ | cosine_mrr@10 | 0.4875 |
756
+ | cosine_map@100 | 0.4943 |
757
+
758
+ <!--
759
+ ## Bias, Risks and Limitations
760
+
761
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
762
+ -->
763
+
764
+ <!--
765
+ ### Recommendations
766
+
767
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
768
+ -->
769
+
770
+ ## Training Details
771
+
772
+ ### Training Dataset
773
+
774
+ #### json
775
+
776
+ * Dataset: json
777
+ * Size: 1,456 training samples
778
+ * Columns: <code>positive</code> and <code>anchor</code>
779
+ * Approximate statistics based on the first 1000 samples:
780
+ | | positive | anchor |
781
+ |:--------|:---------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
782
+ | type | string | string |
783
+ | details | <ul><li>min: 110 tokens</li><li>mean: 795.06 tokens</li><li>max: 1042 tokens</li></ul> | <ul><li>min: 13 tokens</li><li>mean: 30.5 tokens</li><li>max: 215 tokens</li></ul> |
784
+ * Samples:
785
+ | positive | anchor |
786
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------|
787
+ | <code>its continued obligations towards the holders of debt securities. We have satisfied ourselves about the ability of the issuer to service the debt securities. PLACE DATE: DEBENTURE TRUSTEE TO THE ISSUE WITH HIS SEAL Page - 65 - of 68 FORMAT OF DUE DILIGENCE CERTIFICATE TO BE GIVEN BY THE DEBENTURE TRUSTEE BEFORE OPENING OF THE ISSUE To, SECURITIES AND EXCHANGE BOARD OF INDIA Dear Sir / Madam, SUB. : ISSUE OF BY (Issuer) We, the Debenture Trustee (s) to the above mentioned forthcoming issue state as follows: (1) We have examined documents pertaining to the said issue and other relevant documents. (2) On the basis of such examination and discussions with the issuer, its Mayor/Deputy Mayor /Directors and other officers, other agencies and independent verification of the various relevant documents,- (a) WE CONFIRM that the issuer has made adequate provisions regarding escrow payment mechanism for repayment of debt obligations, and (b) We have satisfied ourselves about the ability of the iss...</code> | <code>What specific provisions has the issuer made regarding the repayment of debt obligations?</code> |
788
+ | <code>sums realised by way of penalties to Consolidated Fund of India 23L. Appeal to Securities Appellate Tribunal 23M. Offences 23N. Composition of certain offences 23-O. Power to grant immunity 24. Contravention by companies 25. Certain offences to be cognizable 26. Cognizance of offences by courts 26A. Establishment of Special Courts 26B. Offences triable by Special Courts 26C. Appeal and revision 26D. Application of Code to proceedings before Special Court 26E. Transitional Provisions MISCELLANEOUS 27. Title to dividends 27A. Right to receive income from collective investment scheme 27B. Right to receive income from mutual fund 28. Act not to apply in certain cases 29. Protection of action taken in good faith 29A. Power to delegate 29B. Powers of Board not to apply to International Financial Services Centre 30. Power to make rules 30A. Special Provisions related to commodity derivatives 30B. Special provisions related to pooled investment vehicle 31. Power of Securities and Exchange Boar...</code> | <code>What powers does the Securities and Exchange Board of India have to make regulations according to the Securities Contracts (Regulation) Act, 1956?</code> |
789
+ | <code>the depository or the securities market as a result of the default; and (c) the repetitive nature of the default. ] CHAPTER X PROCEDURE FOR ACTION IN CASE OF DEFAULT Liability for action in case of default 92. Without prejudice to the power of the Board to take action, under the provisions of the Act and the Depositories Act, if a depository or a participant:- (a) contravenes any of the provisions of the Act, the Depositories Act, the bye-laws, agreements and these regulations; (b) fails to furnish any information relating to its activity as a depository or participant as required under these regulations; (c) does not furnish the information called for by the Board under clause (a) of sub-section (1) of section 18 of the Depositories Act or furnishes information which is false or misleading in any material particular; (d) does not co-operate in any inspection or investigation or enquiry conducted by the Board; (e) fails to comply with any direction of the Board issued under section 18 ...</code> | <code>What actions can the Board take against a depository or participant that fails to comply with the provisions of the Act or the Depositories Act?</code> |
790
+ * Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
791
+ ```json
792
+ {
793
+ "loss": "MultipleNegativesRankingLoss",
794
+ "matryoshka_dims": [
795
+ 768,
796
+ 512,
797
+ 256,
798
+ 128,
799
+ 64
800
+ ],
801
+ "matryoshka_weights": [
802
+ 1,
803
+ 1,
804
+ 1,
805
+ 1,
806
+ 1
807
+ ],
808
+ "n_dims_per_step": -1
809
+ }
810
+ ```
811
+
812
+ ### Training Hyperparameters
813
+ #### Non-Default Hyperparameters
814
+
815
+ - `eval_strategy`: epoch
816
+ - `per_device_train_batch_size`: 32
817
+ - `per_device_eval_batch_size`: 16
818
+ - `gradient_accumulation_steps`: 16
819
+ - `learning_rate`: 2e-05
820
+ - `num_train_epochs`: 4
821
+ - `lr_scheduler_type`: cosine
822
+ - `warmup_ratio`: 0.1
823
+ - `bf16`: True
824
+ - `tf32`: True
825
+ - `load_best_model_at_end`: True
826
+ - `optim`: adamw_torch_fused
827
+ - `batch_sampler`: no_duplicates
828
+
829
+ #### All Hyperparameters
830
+ <details><summary>Click to expand</summary>
831
+
832
+ - `overwrite_output_dir`: False
833
+ - `do_predict`: False
834
+ - `eval_strategy`: epoch
835
+ - `prediction_loss_only`: True
836
+ - `per_device_train_batch_size`: 32
837
+ - `per_device_eval_batch_size`: 16
838
+ - `per_gpu_train_batch_size`: None
839
+ - `per_gpu_eval_batch_size`: None
840
+ - `gradient_accumulation_steps`: 16
841
+ - `eval_accumulation_steps`: None
842
+ - `torch_empty_cache_steps`: None
843
+ - `learning_rate`: 2e-05
844
+ - `weight_decay`: 0.0
845
+ - `adam_beta1`: 0.9
846
+ - `adam_beta2`: 0.999
847
+ - `adam_epsilon`: 1e-08
848
+ - `max_grad_norm`: 1.0
849
+ - `num_train_epochs`: 4
850
+ - `max_steps`: -1
851
+ - `lr_scheduler_type`: cosine
852
+ - `lr_scheduler_kwargs`: {}
853
+ - `warmup_ratio`: 0.1
854
+ - `warmup_steps`: 0
855
+ - `log_level`: passive
856
+ - `log_level_replica`: warning
857
+ - `log_on_each_node`: True
858
+ - `logging_nan_inf_filter`: True
859
+ - `save_safetensors`: True
860
+ - `save_on_each_node`: False
861
+ - `save_only_model`: False
862
+ - `restore_callback_states_from_checkpoint`: False
863
+ - `no_cuda`: False
864
+ - `use_cpu`: False
865
+ - `use_mps_device`: False
866
+ - `seed`: 42
867
+ - `data_seed`: None
868
+ - `jit_mode_eval`: False
869
+ - `use_ipex`: False
870
+ - `bf16`: True
871
+ - `fp16`: False
872
+ - `fp16_opt_level`: O1
873
+ - `half_precision_backend`: auto
874
+ - `bf16_full_eval`: False
875
+ - `fp16_full_eval`: False
876
+ - `tf32`: True
877
+ - `local_rank`: 0
878
+ - `ddp_backend`: None
879
+ - `tpu_num_cores`: None
880
+ - `tpu_metrics_debug`: False
881
+ - `debug`: []
882
+ - `dataloader_drop_last`: False
883
+ - `dataloader_num_workers`: 0
884
+ - `dataloader_prefetch_factor`: None
885
+ - `past_index`: -1
886
+ - `disable_tqdm`: False
887
+ - `remove_unused_columns`: True
888
+ - `label_names`: None
889
+ - `load_best_model_at_end`: True
890
+ - `ignore_data_skip`: False
891
+ - `fsdp`: []
892
+ - `fsdp_min_num_params`: 0
893
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
894
+ - `fsdp_transformer_layer_cls_to_wrap`: None
895
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
896
+ - `deepspeed`: None
897
+ - `label_smoothing_factor`: 0.0
898
+ - `optim`: adamw_torch_fused
899
+ - `optim_args`: None
900
+ - `adafactor`: False
901
+ - `group_by_length`: False
902
+ - `length_column_name`: length
903
+ - `ddp_find_unused_parameters`: None
904
+ - `ddp_bucket_cap_mb`: None
905
+ - `ddp_broadcast_buffers`: False
906
+ - `dataloader_pin_memory`: True
907
+ - `dataloader_persistent_workers`: False
908
+ - `skip_memory_metrics`: True
909
+ - `use_legacy_prediction_loop`: False
910
+ - `push_to_hub`: False
911
+ - `resume_from_checkpoint`: None
912
+ - `hub_model_id`: None
913
+ - `hub_strategy`: every_save
914
+ - `hub_private_repo`: None
915
+ - `hub_always_push`: False
916
+ - `gradient_checkpointing`: False
917
+ - `gradient_checkpointing_kwargs`: None
918
+ - `include_inputs_for_metrics`: False
919
+ - `include_for_metrics`: []
920
+ - `eval_do_concat_batches`: True
921
+ - `fp16_backend`: auto
922
+ - `push_to_hub_model_id`: None
923
+ - `push_to_hub_organization`: None
924
+ - `mp_parameters`:
925
+ - `auto_find_batch_size`: False
926
+ - `full_determinism`: False
927
+ - `torchdynamo`: None
928
+ - `ray_scope`: last
929
+ - `ddp_timeout`: 1800
930
+ - `torch_compile`: False
931
+ - `torch_compile_backend`: None
932
+ - `torch_compile_mode`: None
933
+ - `include_tokens_per_second`: False
934
+ - `include_num_input_tokens_seen`: False
935
+ - `neftune_noise_alpha`: None
936
+ - `optim_target_modules`: None
937
+ - `batch_eval_metrics`: False
938
+ - `eval_on_start`: False
939
+ - `use_liger_kernel`: False
940
+ - `eval_use_gather_object`: False
941
+ - `average_tokens_across_devices`: False
942
+ - `prompts`: None
943
+ - `batch_sampler`: no_duplicates
944
+ - `multi_dataset_batch_sampler`: proportional
945
+
946
+ </details>
947
+
948
+ ### Training Logs
949
+ | Epoch | Step | Training Loss | dim_768_cosine_ndcg@10 | dim_512_cosine_ndcg@10 | dim_256_cosine_ndcg@10 | dim_128_cosine_ndcg@10 | dim_64_cosine_ndcg@10 |
950
+ |:-------:|:------:|:-------------:|:----------------------:|:----------------------:|:----------------------:|:----------------------:|:---------------------:|
951
+ | 1.0 | 3 | - | 0.6688 | 0.6577 | 0.6456 | 0.6018 | 0.5202 |
952
+ | 2.0 | 6 | - | 0.6652 | 0.6624 | 0.6634 | 0.6147 | 0.5320 |
953
+ | 3.0 | 9 | - | 0.6868 | 0.6831 | 0.6678 | 0.6330 | 0.5537 |
954
+ | 3.3478 | 10 | 39.9839 | - | - | - | - | - |
955
+ | **4.0** | **12** | **-** | **0.6932** | **0.6862** | **0.6726** | **0.6397** | **0.5603** |
956
+
957
+ * The bold row denotes the saved checkpoint.
958
+
959
+ ### Framework Versions
960
+ - Python: 3.11.11
961
+ - Sentence Transformers: 4.1.0
962
+ - Transformers: 4.52.3
963
+ - PyTorch: 2.7.0+cu126
964
+ - Accelerate: 1.7.0
965
+ - Datasets: 3.6.0
966
+ - Tokenizers: 0.21.1
967
+
968
+ ## Citation
969
+
970
+ ### BibTeX
971
+
972
+ #### Sentence Transformers
973
+ ```bibtex
974
+ @inproceedings{reimers-2019-sentence-bert,
975
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
976
+ author = "Reimers, Nils and Gurevych, Iryna",
977
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
978
+ month = "11",
979
+ year = "2019",
980
+ publisher = "Association for Computational Linguistics",
981
+ url = "https://arxiv.org/abs/1908.10084",
982
+ }
983
+ ```
984
+
985
+ #### MatryoshkaLoss
986
+ ```bibtex
987
+ @misc{kusupati2024matryoshka,
988
+ title={Matryoshka Representation Learning},
989
+ author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
990
+ year={2024},
991
+ eprint={2205.13147},
992
+ archivePrefix={arXiv},
993
+ primaryClass={cs.LG}
994
+ }
995
+ ```
996
+
997
+ #### MultipleNegativesRankingLoss
998
+ ```bibtex
999
+ @misc{henderson2017efficient,
1000
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
1001
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
1002
+ year={2017},
1003
+ eprint={1705.00652},
1004
+ archivePrefix={arXiv},
1005
+ primaryClass={cs.CL}
1006
+ }
1007
+ ```
1008
+
1009
+ <!--
1010
+ ## Glossary
1011
+
1012
+ *Clearly define terms in order to be accessible across audiences.*
1013
+ -->
1014
+
1015
+ <!--
1016
+ ## Model Card Authors
1017
+
1018
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
1019
+ -->
1020
+
1021
+ <!--
1022
+ ## Model Card Contact
1023
+
1024
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
1025
+ -->
config.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "GteModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.0,
6
+ "auto_map": {
7
+ "AutoConfig": "Snowflake/snowflake-arctic-embed-m-v2.0--configuration_hf_alibaba_nlp_gte.GteConfig",
8
+ "AutoModel": "Snowflake/snowflake-arctic-embed-m-v2.0--modeling_hf_alibaba_nlp_gte.GteModel"
9
+ },
10
+ "classifier_dropout": 0.1,
11
+ "hidden_act": "gelu",
12
+ "hidden_dropout_prob": 0.1,
13
+ "hidden_size": 768,
14
+ "initializer_range": 0.02,
15
+ "intermediate_size": 3072,
16
+ "layer_norm_eps": 1e-12,
17
+ "layer_norm_type": "layer_norm",
18
+ "logn_attention_clip1": false,
19
+ "logn_attention_scale": false,
20
+ "matryoshka_dimensions": [
21
+ 256
22
+ ],
23
+ "max_position_embeddings": 8192,
24
+ "model_type": "gte",
25
+ "num_attention_heads": 12,
26
+ "num_hidden_layers": 12,
27
+ "pack_qkv": true,
28
+ "pad_token_id": 1,
29
+ "position_embedding_type": "rope",
30
+ "rope_scaling": null,
31
+ "rope_theta": 160000,
32
+ "torch_dtype": "float32",
33
+ "transformers_version": "4.52.3",
34
+ "type_vocab_size": 1,
35
+ "unpad_inputs": "true",
36
+ "use_memory_efficient_attention": "true",
37
+ "vocab_size": 250048
38
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "4.1.0",
4
+ "transformers": "4.52.3",
5
+ "pytorch": "2.7.0+cu126"
6
+ },
7
+ "prompts": {
8
+ "query": "query: "
9
+ },
10
+ "default_prompt_name": null,
11
+ "similarity_fn_name": "cosine"
12
+ }
eval/Information-Retrieval_evaluation_dim_128_results.csv ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,cosine-Accuracy@1,cosine-Accuracy@3,cosine-Accuracy@5,cosine-Accuracy@10,cosine-Precision@1,cosine-Recall@1,cosine-Precision@3,cosine-Recall@3,cosine-Precision@5,cosine-Recall@5,cosine-Precision@10,cosine-Recall@10,cosine-MRR@10,cosine-NDCG@10,cosine-MAP@100
2
+ 1.0,3,0.38271604938271603,0.6296296296296297,0.7160493827160493,0.8271604938271605,0.38271604938271603,0.38271604938271603,0.20987654320987653,0.6296296296296297,0.14320987654320985,0.7160493827160493,0.08271604938271604,0.8271604938271605,0.5305310601606898,0.6021955458801781,0.5382570699169317
3
+ 2.0,6,0.3888888888888889,0.654320987654321,0.7654320987654321,0.8395061728395061,0.3888888888888889,0.3888888888888889,0.21810699588477364,0.654320987654321,0.1530864197530864,0.7654320987654321,0.0839506172839506,0.8395061728395061,0.5416103272584755,0.6140910300966435,0.5495464886177235
4
+ 3.0,9,0.4074074074074074,0.6851851851851852,0.7654320987654321,0.8641975308641975,0.4074074074074074,0.4074074074074074,0.22839506172839505,0.6851851851851852,0.1530864197530864,0.7654320987654321,0.08641975308641975,0.8641975308641975,0.5579389574759945,0.6321441276006302,0.5642732177438856
5
+ 4.0,12,0.41358024691358025,0.691358024691358,0.7777777777777778,0.8641975308641975,0.41358024691358025,0.41358024691358025,0.23045267489711932,0.691358024691358,0.15555555555555553,0.7777777777777778,0.08641975308641975,0.8641975308641975,0.5667793454830491,0.6391246452415552,0.5733425105038232
6
+ 1.0,3,0.38271604938271603,0.6296296296296297,0.7098765432098766,0.8271604938271605,0.38271604938271603,0.38271604938271603,0.20987654320987653,0.6296296296296297,0.14197530864197527,0.7098765432098766,0.08271604938271604,0.8271604938271605,0.5301268861454047,0.6018461575195746,0.5377603400581437
7
+ 2.0,6,0.3888888888888889,0.6604938271604939,0.7592592592592593,0.8395061728395061,0.3888888888888889,0.3888888888888889,0.22016460905349794,0.6604938271604939,0.15185185185185185,0.7592592592592593,0.0839506172839506,0.8395061728395061,0.5419189692337841,0.6143297820830638,0.5498389832031331
8
+ 1.0,3,0.38271604938271603,0.6296296296296297,0.7098765432098766,0.8271604938271605,0.38271604938271603,0.38271604938271603,0.20987654320987653,0.6296296296296297,0.1419753086419753,0.7098765432098766,0.08271604938271604,0.8271604938271605,0.5300852439741329,0.6018097174751208,0.5377970581405588
9
+ 2.0,6,0.3888888888888889,0.6666666666666666,0.7654320987654321,0.8395061728395061,0.3888888888888889,0.3888888888888889,0.2222222222222222,0.6666666666666666,0.1530864197530864,0.7654320987654321,0.0839506172839506,0.8395061728395061,0.5424064275916128,0.6147165742260062,0.550314018558461
10
+ 3.0,9,0.4074074074074074,0.6851851851851852,0.7654320987654321,0.8641975308641975,0.4074074074074074,0.4074074074074074,0.22839506172839505,0.6851851851851852,0.1530864197530864,0.7654320987654321,0.08641975308641975,0.8641975308641975,0.5589677640603566,0.6329523359560095,0.5653216884828832
11
+ 4.0,12,0.41358024691358025,0.6851851851851852,0.7777777777777778,0.8703703703703703,0.41358024691358025,0.41358024691358025,0.22839506172839505,0.6851851851851852,0.15555555555555553,0.7777777777777778,0.08703703703703702,0.8703703703703703,0.5658534195571232,0.6396728651848874,0.5718872882660476
eval/Information-Retrieval_evaluation_dim_256_results.csv ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,cosine-Accuracy@1,cosine-Accuracy@3,cosine-Accuracy@5,cosine-Accuracy@10,cosine-Precision@1,cosine-Recall@1,cosine-Precision@3,cosine-Recall@3,cosine-Precision@5,cosine-Recall@5,cosine-Precision@10,cosine-Recall@10,cosine-MRR@10,cosine-NDCG@10,cosine-MAP@100
2
+ 1.0,3,0.4074074074074074,0.7098765432098766,0.7901234567901234,0.8827160493827161,0.4074074074074074,0.4074074074074074,0.23662551440329216,0.7098765432098766,0.1580246913580247,0.7901234567901234,0.0882716049382716,0.8827160493827161,0.5694395453654711,0.645656331681653,0.5748532909188923
3
+ 2.0,6,0.42592592592592593,0.7222222222222222,0.8024691358024691,0.8950617283950617,0.42592592592592593,0.42592592592592593,0.24074074074074073,0.7222222222222222,0.16049382716049382,0.8024691358024691,0.08950617283950617,0.8950617283950617,0.5890113658632177,0.6635896599917582,0.5938310269670422
4
+ 3.0,9,0.43209876543209874,0.7345679012345679,0.8271604938271605,0.8888888888888888,0.43209876543209874,0.43209876543209874,0.24485596707818924,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.08888888888888888,0.8888888888888888,0.5959680580050949,0.668083339265037,0.6015420392173096
5
+ 4.0,12,0.4382716049382716,0.7345679012345679,0.8271604938271605,0.8950617283950617,0.4382716049382716,0.4382716049382716,0.24485596707818924,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.08950617283950617,0.8950617283950617,0.599252890456594,0.6717650853475061,0.6045643524656577
6
+ 1.0,3,0.4074074074074074,0.7098765432098766,0.7901234567901234,0.8827160493827161,0.4074074074074074,0.4074074074074074,0.23662551440329216,0.7098765432098766,0.1580246913580247,0.7901234567901234,0.0882716049382716,0.8827160493827161,0.569353811483441,0.6455672274771977,0.5747458795870237
7
+ 2.0,6,0.42592592592592593,0.7222222222222222,0.8024691358024691,0.8950617283950617,0.42592592592592593,0.42592592592592593,0.24074074074074073,0.7222222222222222,0.16049382716049382,0.8024691358024691,0.08950617283950617,0.8950617283950617,0.5890113658632177,0.6635896599917582,0.5937888489324974
8
+ 3.0,9,0.43209876543209874,0.7345679012345679,0.8271604938271605,0.8888888888888888,0.43209876543209874,0.43209876543209874,0.24485596707818924,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.08888888888888888,0.8888888888888888,0.5956594160297862,0.6678128222845503,0.6012364493420029
9
+ 1.0,3,0.4074074074074074,0.7098765432098766,0.7901234567901234,0.8827160493827161,0.4074074074074074,0.4074074074074074,0.23662551440329216,0.7098765432098766,0.1580246913580247,0.7901234567901234,0.0882716049382716,0.8827160493827161,0.569353811483441,0.6455672274771977,0.5747701815434549
10
+ 2.0,6,0.42592592592592593,0.7160493827160493,0.8024691358024691,0.8950617283950617,0.42592592592592593,0.42592592592592593,0.23868312757201646,0.7160493827160493,0.16049382716049382,0.8024691358024691,0.08950617283950617,0.8950617283950617,0.5888056045463451,0.6634322544912167,0.5936225951593602
11
+ 3.0,9,0.43209876543209874,0.7345679012345679,0.8271604938271605,0.8888888888888888,0.43209876543209874,0.43209876543209874,0.24485596707818924,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.08888888888888888,0.8888888888888888,0.5956594160297862,0.6678128222845503,0.6012907092614589
12
+ 4.0,12,0.4382716049382716,0.7345679012345679,0.8271604938271605,0.8950617283950617,0.4382716049382716,0.4382716049382716,0.24485596707818924,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.08950617283950617,0.8950617283950617,0.6002816970409561,0.6725732937028854,0.6055931590500198
eval/Information-Retrieval_evaluation_dim_512_results.csv ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,cosine-Accuracy@1,cosine-Accuracy@3,cosine-Accuracy@5,cosine-Accuracy@10,cosine-Precision@1,cosine-Recall@1,cosine-Precision@3,cosine-Recall@3,cosine-Precision@5,cosine-Recall@5,cosine-Precision@10,cosine-Recall@10,cosine-MRR@10,cosine-NDCG@10,cosine-MAP@100
2
+ 1.0,3,0.41358024691358025,0.7037037037037037,0.7901234567901234,0.8827160493827161,0.41358024691358025,0.41358024691358025,0.2345679012345679,0.7037037037037037,0.1580246913580247,0.7901234567901234,0.08827160493827159,0.8827160493827161,0.5791544189692339,0.6530314161476585,0.5847635261609861
3
+ 2.0,6,0.42592592592592593,0.7160493827160493,0.8148148148148148,0.9074074074074074,0.42592592592592593,0.42592592592592593,0.23868312757201646,0.7160493827160493,0.16296296296296295,0.8148148148148148,0.09074074074074073,0.9074074074074074,0.5879850088183423,0.6654705449887022,0.5916081345311078
4
+ 3.0,9,0.4506172839506173,0.7530864197530864,0.8271604938271605,0.9074074074074074,0.4506172839506173,0.4506172839506173,0.2510288065843621,0.7530864197530864,0.16543209876543208,0.8271604938271605,0.09074074074074073,0.9074074074074074,0.6111086615716245,0.6837877914235437,0.6149641889523849
5
+ 4.0,12,0.4506172839506173,0.7530864197530864,0.8271604938271605,0.9074074074074074,0.4506172839506173,0.4506172839506173,0.2510288065843621,0.7530864197530864,0.16543209876543208,0.8271604938271605,0.09074074074074073,0.9074074074074074,0.6141950813247109,0.6862124164896819,0.618138129001409
6
+ 1.0,3,0.41975308641975306,0.7037037037037037,0.7901234567901234,0.8827160493827161,0.41975308641975306,0.41975308641975306,0.2345679012345679,0.7037037037037037,0.1580246913580247,0.7901234567901234,0.08827160493827159,0.8827160493827161,0.5825494806976288,0.6555801445258522,0.5881869785447996
7
+ 2.0,6,0.41975308641975306,0.7160493827160493,0.8148148148148148,0.9074074074074074,0.41975308641975306,0.41975308641975306,0.23868312757201646,0.7160493827160493,0.16296296296296295,0.8148148148148148,0.09074074074074073,0.9074074074074074,0.5848985890652558,0.6631923335909952,0.5885231233926944
8
+ 3.0,9,0.4567901234567901,0.7469135802469136,0.8271604938271605,0.9074074074074074,0.4567901234567901,0.4567901234567901,0.24897119341563784,0.7469135802469136,0.16543209876543208,0.8271604938271605,0.09074074074074073,0.9074074074074074,0.6147094846168919,0.6864462886956018,0.61858004728435
9
+ 1.0,3,0.41975308641975306,0.7037037037037037,0.7901234567901234,0.8888888888888888,0.41975308641975306,0.41975308641975306,0.2345679012345679,0.7037037037037037,0.1580246913580247,0.7901234567901234,0.08888888888888886,0.8888888888888888,0.5835856359004508,0.6577453114005705,0.5886596467084392
10
+ 2.0,6,0.41975308641975306,0.7160493827160493,0.8148148148148148,0.9074074074074074,0.41975308641975306,0.41975308641975306,0.23868312757201646,0.7160493827160493,0.16296296296296295,0.8148148148148148,0.09074074074074073,0.9074074074074074,0.5838697824808937,0.6623841252356157,0.5874755864501471
11
+ 3.0,9,0.4506172839506173,0.7469135802469136,0.8271604938271605,0.9074074074074074,0.4506172839506173,0.4506172839506173,0.24897119341563784,0.7469135802469136,0.16543209876543208,0.8271604938271605,0.09074074074074073,0.9074074074074074,0.6102856163041347,0.6830893519620286,0.6141635115406537
12
+ 4.0,12,0.4506172839506173,0.7530864197530864,0.8271604938271605,0.9074074074074074,0.4506172839506173,0.4506172839506173,0.2510288065843621,0.7530864197530864,0.16543209876543208,0.8271604938271605,0.09074074074074073,0.9074074074074074,0.6141950813247109,0.6862124164896819,0.618139737272647
eval/Information-Retrieval_evaluation_dim_64_results.csv ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,cosine-Accuracy@1,cosine-Accuracy@3,cosine-Accuracy@5,cosine-Accuracy@10,cosine-Precision@1,cosine-Recall@1,cosine-Precision@3,cosine-Recall@3,cosine-Precision@5,cosine-Recall@5,cosine-Precision@10,cosine-Recall@10,cosine-MRR@10,cosine-NDCG@10,cosine-MAP@100
2
+ 1.0,3,0.29012345679012347,0.5555555555555556,0.6358024691358025,0.7592592592592593,0.29012345679012347,0.29012345679012347,0.18518518518518517,0.5555555555555556,0.1271604938271605,0.6358024691358025,0.07592592592592592,0.7592592592592593,0.44443219674701184,0.5202045641791371,0.4516865012402207
3
+ 2.0,6,0.3148148148148148,0.5617283950617284,0.6728395061728395,0.7716049382716049,0.3148148148148148,0.3148148148148148,0.18724279835390945,0.5617283950617284,0.13456790123456788,0.6728395061728395,0.07716049382716049,0.7716049382716049,0.4595507544581621,0.53470631928077,0.4668547556983398
4
+ 3.0,9,0.3333333333333333,0.5740740740740741,0.6790123456790124,0.7839506172839507,0.3333333333333333,0.3333333333333333,0.19135802469135801,0.5740740740740741,0.13580246913580246,0.6790123456790124,0.07839506172839504,0.7839506172839507,0.4795732902214384,0.5530355054744915,0.486601128663717
5
+ 4.0,12,0.345679012345679,0.5802469135802469,0.6851851851851852,0.7901234567901234,0.345679012345679,0.345679012345679,0.19341563786008228,0.5802469135802469,0.13703703703703704,0.6851851851851852,0.07901234567901233,0.7901234567901234,0.48726484420928873,0.56013834475686,0.4939964614452466
6
+ 1.0,3,0.29012345679012347,0.5617283950617284,0.6296296296296297,0.7592592592592593,0.29012345679012347,0.29012345679012347,0.18724279835390945,0.5617283950617284,0.1259259259259259,0.6296296296296297,0.07592592592592592,0.7592592592592593,0.4458553791887127,0.521340628725392,0.453081691903953
7
+ 2.0,6,0.3148148148148148,0.5617283950617284,0.6728395061728395,0.7716049382716049,0.3148148148148148,0.3148148148148148,0.18724279835390945,0.5617283950617284,0.13456790123456788,0.6728395061728395,0.07716049382716049,0.7716049382716049,0.4596977268273567,0.5348475159090087,0.46692789473198437
8
+ 1.0,3,0.29012345679012347,0.5617283950617284,0.6296296296296297,0.7592592592592593,0.29012345679012347,0.29012345679012347,0.18724279835390945,0.5617283950617284,0.1259259259259259,0.6296296296296297,0.07592592592592592,0.7592592592592593,0.44453507740544795,0.5202217389554445,0.45181102007783713
9
+ 2.0,6,0.30864197530864196,0.5555555555555556,0.6728395061728395,0.7716049382716049,0.30864197530864196,0.30864197530864196,0.18518518518518517,0.5555555555555556,0.13456790123456788,0.6728395061728395,0.07716049382716049,0.7716049382716049,0.4559866745051933,0.5320310829157818,0.46317311303593856
10
+ 3.0,9,0.3333333333333333,0.5740740740740741,0.6790123456790124,0.7901234567901234,0.3333333333333333,0.3333333333333333,0.19135802469135801,0.5740740740740741,0.13580246913580246,0.6790123456790124,0.07901234567901233,0.7901234567901234,0.4788016852831669,0.5537221096806089,0.48526467818974156
11
+ 4.0,12,0.345679012345679,0.5802469135802469,0.6851851851851852,0.7901234567901234,0.345679012345679,0.345679012345679,0.19341563786008228,0.5802469135802469,0.13703703703703704,0.6851851851851852,0.07901234567901233,0.7901234567901234,0.48750489907897315,0.5603350026673091,0.49429085785187665
eval/Information-Retrieval_evaluation_dim_768_results.csv ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ epoch,steps,cosine-Accuracy@1,cosine-Accuracy@3,cosine-Accuracy@5,cosine-Accuracy@10,cosine-Precision@1,cosine-Recall@1,cosine-Precision@3,cosine-Recall@3,cosine-Precision@5,cosine-Recall@5,cosine-Precision@10,cosine-Recall@10,cosine-MRR@10,cosine-NDCG@10,cosine-MAP@100
2
+ 1.0,3,0.4382716049382716,0.7222222222222222,0.8024691358024691,0.8950617283950617,0.4382716049382716,0.4382716049382716,0.24074074074074073,0.7222222222222222,0.16049382716049382,0.8024691358024691,0.08950617283950615,0.8950617283950617,0.5928522437781698,0.6662075156307669,0.597623503737822
3
+ 2.0,6,0.41975308641975306,0.7283950617283951,0.8271604938271605,0.9012345679012346,0.41975308641975306,0.41975308641975306,0.242798353909465,0.7283950617283951,0.16543209876543208,0.8271604938271605,0.09012345679012344,0.9012345679012346,0.5901136586321772,0.6663417172853776,0.5943313477669211
4
+ 3.0,9,0.4444444444444444,0.7345679012345679,0.8395061728395061,0.9135802469135802,0.4444444444444444,0.4444444444444444,0.24485596707818924,0.7345679012345679,0.1679012345679012,0.8395061728395061,0.09135802469135802,0.9135802469135802,0.607363315696649,0.6821825540795592,0.6110287478548849
5
+ 4.0,12,0.4691358024691358,0.7469135802469136,0.845679012345679,0.9135802469135802,0.4691358024691358,0.4691358024691358,0.24897119341563784,0.7469135802469136,0.1691358024691358,0.845679012345679,0.09135802469135802,0.9135802469135802,0.6220483049186752,0.6931888396302245,0.625915955793166
6
+ 1.0,3,0.4382716049382716,0.7345679012345679,0.8024691358024691,0.8950617283950617,0.4382716049382716,0.4382716049382716,0.2448559670781893,0.7345679012345679,0.16049382716049382,0.8024691358024691,0.08950617283950615,0.8950617283950617,0.5951327650401725,0.6680529817242347,0.5999731063107839
7
+ 2.0,6,0.41975308641975306,0.7345679012345679,0.8271604938271605,0.9012345679012346,0.41975308641975306,0.41975308641975306,0.2448559670781893,0.7345679012345679,0.16543209876543208,0.8271604938271605,0.09012345679012344,0.9012345679012346,0.5907137958063885,0.6668587439708613,0.5949318205672114
8
+ 3.0,9,0.4567901234567901,0.7345679012345679,0.8395061728395061,0.9135802469135802,0.4567901234567901,0.4567901234567901,0.24485596707818924,0.7345679012345679,0.1679012345679012,0.8395061728395061,0.09135802469135802,0.9135802469135802,0.6136120909269057,0.686779193022766,0.617263754517914
9
+ 1.0,3,0.4382716049382716,0.7283950617283951,0.808641975308642,0.9012345679012346,0.4382716049382716,0.4382716049382716,0.242798353909465,0.7283950617283951,0.16172839506172837,0.808641975308642,0.09012345679012344,0.9012345679012346,0.5944052518126591,0.6688240150844199,0.5986260312608651
10
+ 2.0,6,0.41975308641975306,0.7222222222222222,0.8271604938271605,0.9012345679012346,0.41975308641975306,0.41975308641975306,0.24074074074074073,0.7222222222222222,0.16543209876543208,0.8271604938271605,0.09012345679012344,0.9012345679012346,0.5886561826376642,0.6651946906534251,0.5928812643052896
11
+ 3.0,9,0.4567901234567901,0.7345679012345679,0.845679012345679,0.9135802469135802,0.4567901234567901,0.4567901234567901,0.24485596707818924,0.7345679012345679,0.1691358024691358,0.845679012345679,0.09135802469135802,0.9135802469135802,0.6136561826376641,0.6868390431651261,0.6173396640342221
12
+ 4.0,12,0.4691358024691358,0.7469135802469136,0.845679012345679,0.9135802469135802,0.4691358024691358,0.4691358024691358,0.24897119341563784,0.7469135802469136,0.1691358024691358,0.845679012345679,0.09135802469135802,0.9135802469135802,0.6220483049186752,0.6931888396302245,0.6258770441533618
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:27157dd3963fc12375050e83b2d0de6822dfae67dbd95e85b1cccc3d118d19c0
3
+ size 1221487872
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa7a6ad87a7ce8fe196787355f6af7d03aee94d19c54a5eb1392ed18c8ef451a
3
+ size 17082988
tokenizer_config.json ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "250001": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "eos_token": "</s>",
48
+ "extra_special_tokens": {},
49
+ "mask_token": "<mask>",
50
+ "max_length": 512,
51
+ "model_max_length": 32768,
52
+ "pad_to_multiple_of": null,
53
+ "pad_token": "<pad>",
54
+ "pad_token_type_id": 0,
55
+ "padding_side": "right",
56
+ "sep_token": "</s>",
57
+ "stride": 0,
58
+ "tokenizer_class": "XLMRobertaTokenizerFast",
59
+ "truncation_side": "right",
60
+ "truncation_strategy": "longest_first",
61
+ "unk_token": "<unk>"
62
+ }